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Introduction 



Although this may seem a paradox, all exact science is dominated by the 
idea of approximation. (Bertrand Russel, logicien et prix Nobel) 

1 Approximation adaptative et anisotrope par ele- 
ments finis : pourquoi et comment ? 

Cette these est consacree au probleme de l'approximation de fonctions par des ele- 
ments finis polynomiaux par morceaux sur des triangulations, et sur des maillages plus 
generaux. Nous nous interessons tout particulierement a la situation oil le maillage est 
construit en adaptation avec la fonction approchee. Ce maillage peut done comporter des 
elements de taille, de rapport d'aspect et d'orientation fortement variables. 

L'approximation par des fonctions polynomiales par morceaux est une procedure qui 
intervient dans de nombreuses applications. Dans certaines d'entre elles comme la com- 
pression de donnees de terrain, d'images ou de surfaces, la fonction / approchee peut etre 
connue exactement. Dans d'autres applications comme le debruitage de donnees, l'ap- 
prentissage statistique ou la discretisation par elements finis d'Equations aux Derivees 
Partielles (EDP), la fonction approchee n'est connue que partiellement, voire totalement 
inconnue initialement. Dans toutes ces applications, on etablit usuellement une distinc- 
tion entre l'approximation uniforme ou adaptative. Dans le cadre uniforme, le domaine 
de definition de la fonction est decompose en une partition formee d'elements de taille 
et de forme comparables, alors que ces attributs peuvent varier fortement dans le cadre 
adaptatif. La partition peut dans ce dernier cas etre adaptee aux proprietes locales de la 
fonction /, dans l'objectif d'optimiser le compromis entre la precision et la complexite de 
1 ' approximat ion . 

Du point de vue de la theorie de l'approximation, ce compromis entre precision et 
complexite est generalement lie a la regularite de la fonction : on s'attend typiquement a 
des vitesses de convergence plus elevees pour des fonctions plus regulieres. Les fonctions qui 
se presentent dans les applications concretes peuvent cependant presenter des proprietes 
heterogenes de regularite, dans le sens ou elles sont regulieres dans certaines regions, 
qui separent des discontinuity localisees. Deux exemples typiques, illustres Figure [T] et 
Figure |2j sont (i) les bords des objets dans les fonctions qui representent des images, 
et (ii) les chocs dans les solutions d'EDP hyperboliques et non-lineaires. Les methodes 
numeriques destinees au traitement de l'image, pour le debruitage ou la compression 
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Figure 1 - Une fonction reguliere par morceaux (haut, gauche), Partition uniforme du 
domaine de definition (haut, droite), Partition associee a 1' approximation adaptative fon- 
dee sur les plus grands coefficients du developpement de la fonction sur la base de Haar 
(bas, gauche), Partition associee a 1' approximation adaptative anisotrope de la fonction 
par des elements finis (bas, droite). (source : G. Peyre) 

notamment, ou dediees a la simulation des EDP beneficient grandement de representations 
economiques et precises de telles fonctions. Une approximation polynomiale par morceaux 
sur une partition uniforme du domaine n'est generalement pas suffisante a cet effet. Un 
premier pas vers l'adaptativite consiste a faire varier la taille des elements qui forment 
la partition en fonction des proprietes locales de regularity de la fonction. Un procede 
tres similaire, souvent utilise en traitement de l'image et illustre Figure [I] (bas, gauche), 
consiste a retenir les termes d'une decomposition en ondelettes de / qui correspondent 
aux plus grands coefficients. Un second pas dans l'adaptativite est de remarquer qu'une 
plus grande resolution de la partition est requise dans la direction orthogonale a la courbe 
de discontinuity que dans la direction tangentielle, et de tirer parti de cette propriete 
en employant une partition anisotrope du domaine. En deux dimensions ces partitions 
sont typiquement construites a l'aide de triangles de fort rapport d'aspect alignes avec les 
discontinuites, comme illustre Figure [I] (bas droite) et Figure [2j 

Dans le contexte de la simulation numerique des EDP, l'adaptativite signifie egalement 
que le maillage de calcul n'est pas fixe a priori, mais dynamiquement mis a jour au cours 
de la simulation a mesure que la solution exacte se devoile. D'un point de vue numerique 
ces methodes requierent des algorithmes et des structures de donnees plus complexes que 
leurs pendants non-adaptatifs. D'un point de vue theorique l'analyse de ces algorithmes 
est difficile, lorsqu'elle est possible. On ne sait en fait rigoureusement etablir que les me- 
thodes adaptatives ameliorent la vitesse de convergence des solutions approchees vers la 
solution exacte, que pour un nombre reduit de systemes d'EDP et seulement dans le cas de 



Figure 2 - Flux d'air autour d'un avion supersonique, calcule a l'aide d'un maillage 
tridimensionnel fortement anisotrope. (source : F. Alauzet [20].) 

l'adaptation isotrope de maillage. Nous referons au survey [77] pour une vue d'ensemble 
de ces resultats dans le cas des equations elliptiques. Nous devons mentionner que ces 
difficultes sont accentuees lorsque des elements finis anisotropes sont utilises. Plusieurs 
logiciels comme |20j|93}[94) utilisent cependant avec succes les maillages anisotropes pour 
la simulation numerique des EDP, comme illustre par exemple Figure [2j D'un point de vue 
numerique l'amelioration apportee par ces methodes semble evidente en comparaison avec 
les methodes non-adaptatives ou adaptatives isotropes. Cependant de nombreux aspects 
de l'analyse theorique de ces methodes restent des questions ouvertes. 

Cette these etudie le probleme de l'adaptation de maillage anisotrope pour l'approxi- 
mation d'une fonction connue. Ceci peut etre considere comme une etape preliminaire 
pour l'analyse de l'adaptation anisotrope de maillage dans la simulation des EDP, mais 
d'autres applications peuvent en tirer parti comme le traitement de donnees de terrain, 
de surfaces ou d'images. 

Etant donnee une triangulation T d'un domaine borne et polygonal Q C 1R 2 , et un 
entier fixe k > 1, nous notons V&(T) l'espace des elements finis de degre k sur T . L'espace 
Vfc(T) est forme de toutes les fonctions qui coincident sur chaque triangle T £ T avec un 
polynome de degre total k 

Vfc(T) := {g ; g ]T € P fc , T e T}. 

La dimension de Vfc(T) est de l'ordre de 0(k 2 #(T)). Etant donnee une fonction / : Q — > R 
et une triangulation T de Q, l'erreur de meilleure approximation de / dans Vfc(T) est 
definie par 

e T {f) x := inf \\f - g\\ x . (1) 

gevUT) 

La lettre X designe la norme ou la semi-norme dans laquelle l'erreur d'approximation 
11/ ~ Pllx est mesuree. Dans cette these nous restreignons notre attention a la norme LP 
et a la semi-norme W l,p , oil 1 < p < oo. Elles sont definies comme suit : 



\h\\LP(n) ■= [ J \h\ p J et \h\ w i, P{n) := I J \Vh\ 



V 

1 



avec la modification usuelle lorsque p = oo. Notons que Ton doit imposer la continuite 
globale de g dans la definition ci dessus de V&(T) lorsqu'on utilise la semi-norme W 1,p . 
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La meilleure approximation g e Vfe(T") de / peut etre calculee exactement dans le cas 
de la norme L 2 ou de la semi-norme W 1 ' 2 (ou H l ) : g est la projection orthogonale de / 
sur Vk(T) par rapport au produit scalaire associe a la norme d'interet. Dans le cas p ^ 2 
de normes non hilbertiennes la meilleure approximation de / est generalement difficile a 
calculer, mais des approximations "satisfaisantes" peuvent etre obtenues par differentes 
methodes. Si la fonction est continue, on peut utiliser l'interpolation de Lagrange, tandis 
qu'un operateur de quasi-interpolation est prefere pour les fonctions non-lisses, voir Cha- 
pitre[6j Plus generalement, si Pj est un operateur arbitraire de projection de l'espace X 
sur Vfc(T), il est aise de voir que Ton a pour toute / e X 

\\f-Prf\\x<Ce T (f)x, (2) 

ou C := 1 + ||Pr||x^x- Le probleme de 1' approximation d'une fonction / sur une trian- 
gulation donnee T, par des elements finis de degre k, est done en bonne partie resolu. 



Dans le contexte de l'approximation adaptative, la triangulation T du domaine VL 
n'est pas fixee, mais peut etre choisie librement en adaptation avec / (par contraste nous 
supposons toujours dans cette these que l'entier k est fixe, bien qu'arbitraire). Ceci nous 
mene naturellement a l'objectif de caracteriser et de construire un maillage optimal pour 
une fonction donnee /. Etant donnee une norme X d'interet et une fonction / a approcher, 
nous formulons le probleme de ['adaptation optimale de maillage, comme la minimisation 
de l'erreur d' approximation parmi toutes les triangulations de cardinality donnee. Nous 
defmissons done l'erreur de meilleure approximation adaptative comme suit : 

eN(f)x '■= inf e r (f)x = inf inf \\f - g\\x- (3) 

#(T)<N #{T)<N 9eV k (T) U 

Par contraste avec le probleme de la meilleure approximation par elements finis sur un 
maillage fixe, l'approximation adaptative et anisotrope n'est pas encore bien comprise. 
En particulier (i) comment le maillage optimal depend-il de la fonction f, et (ii) com- 
ment l'erreur optimale CN{f)x decroit-elle lorsque N augmented Ces problemes sont bien 
compris dans le cadre isotrope, ou l'optimisation est restreinte aux triangulations dans 
lesquelles les triangles satisfont uniformement une contrainte sur leur rapport d'aspect 

diam(T) 2 < C\T\ 

oil diam(T) et |T| representent le diametre et l'aire de T respectivement, et C > est 
une constante fixee. Dans le cadre general des triangulations potentiellement anisotropes, 
ce sont des problemes ouverts. 

Heuristiquement, la simplicite de Q en comparaison avec ^ vient ce que l'optimisa- 
tion est posee sur l'espace lineaire VkiT), et qu'une solution presque optimale peut done 
etre obtenue en appliquant a / un operateur de projection stable comme indique dans 
(J2|. Par contraste, le probleme d'optimisation ^ est pose sur la reunion d'espaces Vk(T) 
pour toutes les triangulations T satisfaisant #(T) < N. II s'agit done d'un probleme d 'ap- 
proximation non-lineaire. D'autres exemples de ce type de probleme sont l'approximation 
par les iV meilleurs termes dans un dictionnaire de fonctions, ou l'approximation par des 
fonctions rationnelles. Nous referons le lecteur a (42] pour un survey sur l'approximation 
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non-lineaire. 

L'objectif de cette these est de mieux comprendre le probleme d'adaptation de maillage 
optimale pose sur la classe entiere des triangulations potentiellement anisotropes. Les 
quatre parties de cette these sont consacrees respectivement aux quatre questions ci- 
dessous : 

I. Comment l'erreur d'approximation ejy(/)jc se comporte-t-elle dans le regime asymp- 
totique ou le nombre N de triangles tend vers l'infini, lorsque / est une fonction 
suffisamment reguliere? Nous etablissons dans ce contexte une caracterisation ma- 
thematique du maillage optimal, ainsi que des estimations precises superieures et 
inferieures de ejsf(f)x a l'aide de A" et de quantites qui dependent non lineairement 
des derivees de /. 

II. Quelles classes de fonctions gouvernent la vitesse de decroissance de ex(f)x lorsque 
N augmente, et sont en ce sens naturellement liees au probleme d'adaptation op- 
timale de maillage ? Nous pensons en particulier a ce qu'on appelle les fonctions 
cartoon, qui par definition sont regulieres excepte le long d'une famille de courbes de 
discontinuity. II s'agit d'un modele d'image populaire, illustre par exemple Figure [T] 
Nous verrons que ce modele s'inscrit naturellement dans une classe de fonctions plus 
riche qui correspond a une vitesse donnee de decroissance de e^(f)x- 

III. Le probleme d'optimisation pj, qui porte sur les triangulations T de cardinalite 
donnee N, peut-il etre remplace par un probleme equivalent mais plus accessible ? 
Les triangulations sont en effet des objets combinatoires et discrets, decrits par leurs 
sommets et aretes, ce qui est peu commode lorsque Ton resout des problemes d'opti- 
misation de la forme de (|3]). Nous etudions la correspondance entre certaines classes 
de triangulations et de metriques riemanniennes qui sont par contraste des objets 
continus. Ceci nous permet de reformuler le probleme original d'optimisation par un 
probleme plus facilement soluble pose sur l'ensemble des metriques riemanniennes. 

IV. Est-il possible de construire une suite quasi-optimale de triangulations (Tn)n>o, ou 
#(7jv) = N, en utilisant une procedure hierarchique de raffinement! La propriete 
de hierarchie garantit l'inclusion des espaces d'elements finis associes aux triangula- 
tions : Vfc(7jv) C Vfc(7jv+i)- Ehe est requise dans des applications comme le codage 
progressif ou le traitement de donnees en ligne (i.e. a mesure qu'elles sont trans- 
mises). Nous proposons un algorithme simple et explicite qui donne une reponse 
positive a cette question sous certaines conditions. 

2 Etat de Part 

Avant d'entrer dans le detail du contenu de la these, nous donnons dans cette section 
un apergu rapide de l'etat de l'art dans les deux problemes de l'etude de ex{f)x lorsque 
augmente et de la construction d'une triangulation proche de l'optimale. Nous devons 
pour cela introduire certaines notations. Nous supposons dans la suite que / G C°(0), 
ou Q C M 2 est un domaine polygonal borne et Q designe l'adherence de Q. Pour chaque 
triangulation T de nous notons l5- l'operateur usuel d'interpolation de Lagrange sur 
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les elements finis de degre k sur T : sur chaque triangle T e T Interpolation 1^- / est 
l'unique element de qui coincide avec / aux points de coordonnees barycentriques 
{0, |, • • • , ^i^, 1}- Lorsque k = 1 ces points sont simplement les trois sommets de T. Si 
la triangulation T est conforme (chaque arete de chaque triangle est soit sur le bord de 
f2, soit coincide avec l'arete entiere d'un autre triangle), alors 1^-/ est continu. Pour etre 
consistant avec le reste de cette these nous definissons m := k + 1 > 2, et nous avons done 

e T (/k P <||/-ir 1 /ib>(fi) et er(/V.,<||V/-V^- 1 /||iP (n) . 

Toute estimation sur l'erreur d'interpolation donne done une estimation superieure sur 
l'erreur de meilleure approximation ef{f)x- De plus si / est suffisamment reguliere et T 
suffisamment fine, alors ces quantites sont generalement comparables. 

L'un des resultats fondateurs en approximation adaptative et anisotrope par des ele- 
ments finis porte sur les elements finis affines par morceaux (m = 2) lorsque l'erreur 
est mesuree en norme L p , voir aussi |27| ou Chapitre [2] Ce resultat peut etre formule 
comme suit : pour tout domaine polygonal borne Q C R 2 , pour tout 1 < p < oo et pour 
toute fonction / e C 2 (Q), il existe une suite (Tn)n>n de triangulations de Q satisfaisant 
#(T N ) < N, et telles que 

limsupiV||/-I^/|U P(n) <C 
oil l'exposant r G (0, oo) est defini par 

r p 

et C est une constante universelle (C est independante de p, Q et /). Nous rappelons que 
la limite superieure d'une suite (un)n>n es ^ definie par 

limsupwAr := lim sup u n , (5) 

et est en general strictement inferieure au supremum sup Ar>A r o u^. Trouver une majoration 
pertinente de sup Ar>A r ||/ — 1^- f\\hp{vi) reste aujourd'hui un probleme ouvert lorsque des 
triangulations anisotropes adaptees de maniere optimale sont utilisees. Le resultat (HI) 
s'etend a l'exposant p = oo, et aux maillages simpliciaux de domaines Q de plus grande 
dimension, mais les maillages Tn peuvent ne pas etre conformes. 

Le resu ltat Q rev ele que la precision de 1' approximation de / est gouvernee par la 
quantite yj det(rf 2 /)| qui depend non lineairement de la matrice hessienne d 2 f. Cette de- 
pendance non lineaire est fortement liee au fait que nous autorisons des triangles de formes 
potentiellement fortement anisotropes. L'estimation d'erreur Q est obtenue en produi- 
sant des triangulations suffisamment fines qui combinent les deux proprietes heuristiques 
suivantes : 

a) Equidistribution des erreurs : la contribution ||/ — I r /||lp(t) de chaque triangle T e Tn 
a l'erreur d'interpolation globale ||/ — 1^ f\\LP(T) est du meme ordre. Cette condition 
se traduit par une contrainte locale sur l'aire des triangles, qui est dictee par le com- 
portement local de / et en particulier par det(d 2 f (z)) . 



det(d 2 /)l 



LT(n') 



(4) 



Figure 3 - Illustration de la strategie des patchs locaux : les regions fi, sont couvertes 
par des pavages periodiques, qui sont ensuite recolles. 



b) Forme optimale des triangles : le rapport d'aspect et l'orientation d'un triangle T G 7/v 
est dicte par le rapport des valeurs propres et par la direction des vecteurs propres de 
la matrice hessienne d 2 f(z) pour z G T. 

La methode la plus simple pour construire une suite (J~n)n>n de triangulations satis- 
faisant Q est d'utiliser une strategie de "patchs locaux" que Ton peut decrire intuitivement 
comme suit. Dans une premiere etape le domaine Q est decoupe en regions 1 < % < k, 
suffisamment petites pour que la matrice hessienne d 2 f(z) varie peu sur chaque fli autour 
d'une valeur moyenne Mj. En d'autres termes / est bien approchee par un polynome 
quadratique sur chaque fij. Chaque region fl^ est ensuite pavee par une triangulation uni- 
forme dont les mailles sont de taille, de rapport d'aspect et d'orientation dictes par 
Mi. Les triangulations de Q sont ensuite reco flees de maniere conforme pour former 
une triangulation 7jv de Q, au prix de quelques triangles supplementaires aux interfaces 
entre les 

Cette construction, illustree Figure [3] est suffisante pour etablir le resultat asympto- 
tique (|4]), mais pas pour des applications pratiques car elle ne devient efficace que pour un 
grand nombre de triangles. L'approche suivante, fondee sur les metriques riemanniennes, 
est souvent preferee dans les applications. Pour simplifier l'exposition nous supposons que 
la matrice hessienne M(z) := d 2 f(z) est definie positive en chaque point z G f2, et nous 
defmissons 

H{z) := A 2 (det M(z))-*&M(z) (6) 

ou A > est une constante dont le role est de controler la resolution de la triangulation. 
Une telle fonction H, qui associe confinement a chaque point z G £1 une matrice syme- 
trique definie positive H(z) G S}, est appelee une metrique riemannienne. Notez que pour 
chaque z G Q, la matrice H(z) definit une ellipse 

£ z := {wet 2 ; u T H{z)u < 1}. 

Comme illustre Figure [4], partie gauche, une metrique encode a chaque point z G Q 
une information d'aire, de rapport d'aspect et d'orientation, sous la forme d'une matrice 
symetrique definie positive H(z) ou de maniere equivalente d'une ellipse £ z . Plusieurs 



algorithmes de generation de maillage, comme [93,94 , sont capables de produire une 
triangulation adaptee a la metrique z i-> H(z), dans le sens ou pour chaque z G VL le 
triangle T G T contenant z a une forme "similaire" a l'ellipse £ z comme illustre Figure |4| 
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Figure 4 - Une metrique et une triangulation adaptee. (source : J. Schoen (84 



partie droite. En termes mathematiques, cela signifie que pour chaque triangle T £ T et 
chaque z G T, Ton a 



&t + Ci£ 2 CT C b T + c 2 £ z 



(7) 



oil < c\ < C2 sont des constantes fixees et br designe le barycentre de T, ce qui signifie 
aussi que T est "proche" d'etre un triangle equilateral dans la metrique H(z). Si T est 
une telle triangulation, et si A est suffisamment grand, un argument heuristique (qui sera 
rappele Chapitre[2| montre que 



#(r)||/-I^/|k(n)<C||^detp 



L T (f2), 



oil la constante C depend de C\ et c 2 . 

En faisant varier le parametre A nous obtenons differentes triangulations 7a, de car- 
dinality proportionnelle a A 2 , ce qui mene a l'estimation d'erreur Q. Mentionnons que 
la construction du maillage T a partir de la metrique H n'est pas evidente. De plus il 
n'existe pas de preuve rigoureuse que la condition de similarite ([7]) est verifiee par les al- 
gorithmes de generation de maillage les plus courants, a l'exception notable de 66 et [TBI. 



La borne d' approximation Q est optimale si Ton se restreint aux triangulations qui 
satisfont une condition technique definie comme suit. Nous disons qu'une suite (J~n)n>n 
de triangulations d'un domaine polygonal Qcl 2 est admissible si #(7jv) < iV et si 

sup I y/~N max diam(T) I < oo. (8) 

N>N V t GTjv / 

Pour toute suite admissible (Tn)n>n de triangulations de fl, on peut etablir la minoration 



liminf N\\f 



r N J \\lp(q) 



> c 



det(rf 2 /)l 



L r (n) 



(9) 



ou la constante c > est universelle (voir aussi Theoreme [2] dans le plan de la these 
ci dessous). De plus la condition d'admissibilite n'est pas trop restrictive : pour chaque 
e > il existe une suite admissible de triangulations (7~n)n>n qui satisfait la majoration 
d'erreur Q a la constante e pres ajoutee au terme de droite. 



Des resultats similaires a Q et (|9j peuvent etre developpes pour les maillages isotropes, 
dans lesquels la taille des triangles peut varier mais pas leur forme, en ce sens que la mesure 
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de degenerescence p(T) := diam(T) 2 /|T| est uniformement bornee par une constante po, 
voir par exemple [35]. Dans ce cas l'estimation Q doit etre remplacee par 

limsupiVH/ - 4 f\\ LP(n) < C\\d 2 f\\ L r (nh (10) 

iV-s>oo 

avec la meme valeur de r, et restimation Q a un pendant similaire. Les constantes C 
et c apparaissant dans ces estimations dependent desormais de la borne po sur la mesure 
de degenerescence. Ainsi la quantite non lineaire \J\ det(d 2 f)\ est remplacee par le terme 
lineaire cPf dans la norme L T , et ces resultats sont desormais tres similaires a ceux de 
meilleure approximation par N ondelettes |30| . 

En termes des valeurs propres Xi(z), \2{z) de la matrice symetrique d 2 f(z) nous rem- 
plagons done la moyenne geometrique y |Ai(z)A2(^)| par max{|Ai(z)|, ^2(2) I }> qui peut 
etre significativement plus grand quand ces valeurs propres sont d'ordres de grandeur dif- 
ferent s. C'est le cas typiquement si la fonction / approchee presente des caracteristiques 
fortement anisotropes, et nous pouvons done nous attendre a une amelioration substan- 
tielle des proprietes d'approximation lorsque des maillages anisotropes sont utilises pour 
de telles fonctions. 

Le resultat (HJ) donne un compte rendu precis de l'amelioration que peuvent appor- 
ter des triangulations anisotropes en comparaison avec les triangulations isotropes, mais 
malheureusement seulement dans un cadre restreint, ce qui a motive notre travail : 

I. Le resultat original ne s'applique qu'a l'erreur d'interpolation lineaire mesuree en 
norme L p , alors que les elements finis de plus haut degre et les normes de Sobo- 
lev W 1,p sont aussi pertinents. En particulier la norme W 1,2 (ou if 1 ) apparait tres 
naturellement dans le contexte des EDP elliptiques. 

II. La fonction approchee / doit etre C 2 , alors que les applications les plus interessantes 
de l'approximation adaptative font intervenir des fonctions non lisses voire disconti- 
nues. Quel sens peut-on donner a la quantite || y/\ det(rf 2 /)| \\l t (q) lorsque / est une 
fonction discontinue ? 

III. La metrique riemannienne z 1— > H(z) est utilisee comme objet intermediaire pour la 
generation de maillages dans les applications numeriques. Cette approche manque 
cependant d'un resultat precis d'equivalence entre ces objets continus et les diffe- 
rentes classes de triangulations anisotropes. Sous quelles conditions peut-on associer 
a une metrique z 1— > H(z) une triangulation T qui lui est adaptee au sens de <Q ? 

IV. Les algorithmes precedemment mentionnes de generation de maillages anisotropes 
ne sont pas hierarchiques, dans le sens ou une meilleure precision n'est pas atteinte 
par un raffmement local mais par la re-generation globale d'un nouveau maillage. 
Peut-on proposer un algorithme de raffmement hierarchique qui permet d'obtenir 
l'erreur d'approximation optimale Q ? 

Nous devons aussi mentionner deux problemes fondamentaux qui sont discutes dans 
cette these, mais qui sont restes des problemes ouverts et feront l'objet de travaux fu- 
ture. En premier lieu l'estimation d'erreur Q ne donne qu'une information asymptotique, 
lorsque le nombre de triangles tend vers 00, alors qu'une estimation d'erreur portant sur 
toutes les valeurs de N est fortement souhaitee. En second lieu l'extension de ce resultat 
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a des fonctions definies sur des domaines de dimension superieure a deux est entravee par 
un probleme difficile de geometrie algorithmique : la production de maillages conformes 
et anisotropes en dimension 3 ou plus elevee. Des methodes numeriques comme 94 s'at- 
taquent a ce probleme, mais il n'est pas resolu d'un point de vue theorique. 



3 Plan de la these 



La these est constitute de quatre parties qui visent a resoudre les quatre problemes 
clefs, numerates I a IV, que nous avons rencontres dans les resultats anterieurs sur l'ap- 
proximation adaptative et anisotrope par elements finis. Ces quatre parties sont essentiel- 
lement auto-consistantes et peuvent done etre lus independamment. Les chapitres consti- 
tuant chaque partie gagnent a etre lus dans l'ordre, a l'exception du chapitre 1 dont la 
lecture peut etre ignoree par le lecteur souhaitant aller plus rapidement au coeur du sujet. 
Les chapitres 1, 2, 3, 4, 7, 8, ainsi que la troisieme partie du chapitre 9, sont respecti- 

Les notations utilisees dans 



vement issus des articles 9 , 72 , 73 , 37 , 34 , 36 



et 



35 



ces articles ont ete unifies pour la clarte de l'ensemble. Les chapitres 5 et 6 sont l'objet 
d'articles en preparation. 



Partie I. Elements finis de degre arbitraire, et normes de Sobolev 

Nous generalisons dans cette partie l'estimation d'erreur asymptotique Q aux ele- 
ments finis de degre arbitraire et aux semi-normes de Sobolev W 1,p pour la mesure de 
l'erreur. Cette analyse nous amene a introduire des concepts clefs pour l'adaptation opti- 
male de maillage. 

Pour commencer nous considerons dans le Chapitre [T] des partitions d'un domaine rec- 
tangulaire en rectangles alignes avec les axes de coordonnees, comme illustre Figure |5j a 
la place de triangles de directions arbitraires. De telles partitions sont pertinentes lorsque 
les axes de coordonnees jouent un role privilegie, de sorte que les traits anisotropes de 
la fonction / sont alignes avec les axes de coordonnees. Nous obtenons une estimation 
d'erreur asymptotique optimale dans ce contexte. Nos resultats s'appliquent a des poly- 
nomes par morceaux de degre arbitraire, en dimension quelconque d > 1, lorsque l'erreur 
d' approximation est mesuree en norme L p . Nous ne considerons pas ici les normes W l,p 
car ces approximations polynomials par morceaux sont generalement discontinues. 

Le principal avantage de ce cadre est que les details techniques requis pour la construc- 
tion d'une partition anisotrope du domaine, ainsi que l'analyse d'erreur, sont simplifies 
par la presence de directions privilegiees. Nous tirons avantage de ce contexte simple pour 
introduire et etudier un concept clef appele la fonction de forme, ou shape function en 
anglais, qui gouverne l'erreur d'approximation locale apres une adaptation optimale des 
elements de la partition aux proprietes locales de la fonction approchee. Cette fonction 
de forme est aussi definie et utilisee dans les Chapitres 2 et 3 pour des triangulations 
anisotropes. Nous donnons ci-dessous sa definition precise dans ce cadre. 

Le Chapitre [2] est consacre aux elements finis triangulaires de degre arbitraire m — 1 , 
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Figure 5 - Partition anisotrope d'un domaine en rectangles. 



ou m > 2, lorsque l'erreur est mesuree en norme LP . Pour presenter nos resultats, nous 
devons introduire quelques notations. Nous notons H m l'espace vectoriel des polynomes 
homogenes de degre m : 



Span{x k y l ; k + I = m}. 



Un ingredient clef de notre approche est la fonction de forme K m<p : H m — > IR + , ou 
1 < p < oo est un exposant donne. Cette fonction est definie par une optimisation de 
l'erreur LP d'interpolation parmi les triangles d'aire 1 de toutes les formes possibles : pour 

tOUt 7T G H m , 

(11) 



inf Ik - I™ 1 it\\lp(t), 
\t\=i K ' 



ou I™ 1 designe l'operateur d'interpolation locale sur T. Notre resultat principal est l'es- 
timation d'erreur asymptotique suivante. 

Theoreme 1. Soit Q C 1R 2 un domaine polygonal borne, soit f G C m (Q) et soit 1 < p < 
oo. i7 existe une suite (Tn)n>n > #(7jv) < N, de triangulations conformes de Q telles que 



limsupiVT||/-I--i/|U ! > ( n) 

N— >oc 



< 



K, 



m,p 



oTf 



(12) 



L T {Q) 



oil V exposant r G (0, oo) est defini par - :- 



mil 
2 p- 



Dans l'estimation d'erreur (12), nous identifions la donnee d m f(z) des derivees d'ordre 
m de / au point z au polynome homogene qui lui correspond dans le developpement de 
Taylor de / en z. Sous forme mathematique 



d m f(z) 



ml 



E 

k+l=m 



d m f x k y l 
d k xd l y^ldl\' 



Le Theoreme [I] etend le resultat connu (Q aux elements finis de degre arbitraire m — 1. 
De maniere similaire, la qualite de 1' approximation adaptative et anisotrope de / est deter- 
minee par une expression non-lineaire des derivees de / : la fonction de forme K m p (d m f(z)) 
est la "generalisation" aux derivees d'ordre superieur du determinant y | det{d 2 f {z))\ ap- 
paraissant dans Q. 
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Ces resultats motivent une etude approfondie de la fonction de forme K miP . Nous avons 
montre que K 2>p , qui correspond au cas m = 2 de 1' approximation affine par morceaux, 
est proportionnel a la racine carree du determinant du polynome quadratique u = ax 2 + 
2bxy + cy 2 G H 2 : 

K 2 , p (7t) = c 2)P v1 det7r| = c 2 , p ^/\ac - b 2 \, 

oil la constante c 2iP > depend seulement du signe de det tt. Nous retrouvons done le 
resultat anterieur Q. Dans le cas m = 3 de 1' approximation quadratique par morceaux, 
nous montrons que la fonction de forme K^ tP est la racine quatrieme du discriminant du 
polynome cubique homogene n = ax 3 + 3bx 2 y + 3cxy 2 + dy 3 G H3 : 

Kz, p {^) = C3 >p \f\ disc 7r\ = cs i p\/\4(ac — b 2 )(bd — c 2 ) — (ad — bc) 2 \, 

oil la constante c? JtP > depend seulement du signe de disc tt. Pour les plus grandes valeurs 
de m, m > 4, nous n'avons pas obtenu d'expression explicite de la fonction de forme, mais 
une quantite explicite qui lui est uniformement equivalente. Cet equivalent a la forme 
suivante : il existe un polynome Q m (a , • • ■ , a m ) des m + 1 variables a , ■ • • , a m , et une 
constante C m > 1 telle que pour tout tt = aox m + a\x m ~ l y + • • • a m y m G H m on ait en 
notant r m := degQ m , 



C' 1 r ^/Q m (a ,--- ,a m ) < K nhP {n) < C m r ^Q m (a ,--- ,a m ). (13) 
Le polynome Q m s'obtient a l'aide de la theorie des polynomes invariants developpee par 



Hilbert dans 61 . Nous caracterisons aussi les zeros de la fonction de forme, et done les 
cas possibles de "super-convergence" : K m ^{ji) = si et seulement si n se factorise par 
un facteur lineaire ax + by de multiplicite s > m/2, en d'autres termes si le polynome 
homogene it est suffisamment degenere. 

La preuve du Theoreme [l] est fondee sur la "strategie des patchs locaux" qui a ete 
evoquee precedemment : on considere en premier lieu une "macro-triangulation" TZ du 
domaine initial Q, qui est suffisamment fine pour que les derivees d'ordre m de f varient 
peu sur chaque triangle R G 1Z autour d'une valeur moyenne n R G H m . A chaque poly- 
nome tt r , R G TZ, on associe ensuite un triangle T R qui minimise, ou presque, le probleme 
d'optimisation defmissant K m ^{n R ). On pave ensuite chaque "macro-triangle" R G 1Z de 
maniere periodique en utilisant le triangle Tr convenablement mis a l'echelle et son sy- 
metrique par rapport a l'origine. Finalement, comme illustre Figure [3j la triangulation 
7/v est obtenue en recollant ensemble les pavages periodiques dermis sur chaque R G TZ, 
a l'aide de quelques triangles supplementaires aux interfaces pour obtenir un maillage 
globalement conforme. 



Le theoreme suivant etablit que l'estimation asymptotique (12) est optimale, si l'on 
se restreint aux suites admissibles de triangulations qui sont defmies par la condition ([8]). 
Ce theoreme montre de plus que la condition d'admissibilite n'est pas trop restrictive. 

Theoreme 2. Soit Q cl 2 un domaine polygonal borne, soit f G C m (Q) et soit 1 < p < 
00. Soit (Tn)n>n > #(7/v) < N, une suite admissible de triangulations de Q. Alors 

lmiMNf\\f-I^f\\ LP(n) > K m J^() , (14) 
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Figure 6 - Interpolation d'une fonction parabolique bi-dimensionelle sur un maillage 
compose de triangles aigus (gauche), ou bien "plats" et fortement obtus (droite). 



ou 



+ - . De plus pour chaque e > il existe une suite admissible (7^) 



N>N 



de 



triangulations de Vt, #(7jv) < N, telle que : 



limsupiV?||/-I^ 1 /ll 



N- 



< 



K, 



m.p 



oTf 



ml 



+ £. 



(15) 



L T (f2) 



Le Chapitre [3] est consacre aux versions des Theoremes [T] et [2] lorsque l'erreur d'inter- 
polation est mesuree dans la semi-norme de Sobolev W 1,p , 1 < p < oo. Ces estimations 
font intervenir l'analogue L m>p de la fonction de forme K m<p qui est defini comme suit : 
pour tout 7r G H m 



inf 

|T[=1 



V(tt-I 



T 



7T 



\LP(T)- 



Nous donnons de nouveau des equivalents explicites de L mjP (7r), definis de la meme maniere 



que (13) par une expression algebrique en les coefficients de n G H m . Nos resultats pour 



les semi-normes W 1 ' p sont done extremement similaires aux resultats obtenus pour les 
normes LP . 

L'adaptation des preuves n'est en revanche pas evidente, a cause du phenomene sui- 
vant : la presence de triangles fortement obtus (avec un angle proche de 7r) dans un 
maillage peut causer des oscillations du gradient de l'interpolation d'une fonction, comme 
illustre Figure |6j Ce phenomene deteriore l'erreur d'interpolation dans la semi-norme W 1,p , 
mais pas dans la norme L p . Ces triangles "plats" doivent done etre evites avec precaution. 
En resume, les triangles longs et fins peuvent etre souhaitables mais il ne doivent pas etre 
trop fortement obtus. 

Avant de continuer la description de cette these, nous rappelons au lecteur que les 
trois chapitres qui composent la Partie |T] sont auto-consistants et peuvent done etre lus 
independamment . 



Partie II. Classes d'approximation anisotropes et modeles d'images 

Dans cette partie, formee de l'unique Chapitre 4, nous discutons de l'extension de nos 
resultats d'approximation aux fonctions non lisses. 

II existe des moyens varies de mesurer la regularite de fonctions definies sur un domaine 
O C R 2 , le plus souvent au moyen d'un espace de regularite approprie et d'une norme 



14 



Introduction 



associee. Des exemples classiques sont les espaces de Sobolev et de Besov. Ces espaces 
sont souvent utilises pour decrire la regularite de solutions d'EDP. D'un point de vue 
numerique ils caracterisent precisement la vitesse a laquelle une fonction / peut etre 
approchee par des fonctions plus simples comme les series de Fourier, les elements finis 
(sur des triangulations isotropes), les fonctions splines ou les ondelettes. 

Le resultat d'approximation adaptative anisotrope Q et sa generalisation par le Theo- 
reme [I] indiquent que la qualite de 1' approximation d'une fonction / par des elements finis 
sur des triangulations anisotropes est gouvernee par une quantite non-lineaire de ses de- 
rivees, d'un point de vue asymptotique du moins. Dans le cas des elements finis de degre 
1, et de l'approximation en norme L 2 , la quantite pertinente est la suivante 



A(J):= V\det(d*f)\ 



L 2 / 3 (Q) 



La fonctionnelle A differe fortement des normes de Sobolev, Holder ou Besov, car elle est 
fortement non lineaire : A ne satisfait pas l'inegalite triangulaire, ni aucune quasi-inegalite 
triangulaire. En d'autres termes pour chaque constante C il existe /, g G C 2 (Q) telles que 

A(f + g)>C(A(f) + A(g)). (16) 

L'absence d'une inegalite triangulaire interdit d'utiliser les techniques classiques de l'ana- 
lyse lineaire pour definir un espace de regularite attache a la fonctionnelle A. L'extension 
du resultat d'approximation (Jil) aux fonctions qui ne sont pas C 2 n'est done pas evidente. 

Les fonctions qui se presentent dans les applications concretes, comme par exemple en 
traitement de l'image ou comme solutions d'EDP hyperboliques, presentent souvent des 
zones de regularite separees par des discontinuites localisees. Un modele mathematique 
simple pour ce type de comportement est donne par la classe des fonctions cartoon, qui 
sont regulieres excepte le long d'une famille de courbes elles memes regulieres, a travers 
lesquelles elles sont discontinues. Une analyse heuristique presentee dans le Chapitre [4] 
suggere que pour toute fonction cartoon / definie sur un domaine polygonal borne Q, il 
existe une suite (J~n)n>n de triangulations anisotropes de fl, #(7jv) < N, telles que 

sup iV||/-I^/|| £a( „ ) <oo. (17) 

Af>iVo 

Comme illustre Figure [TJ les triangulations (Tn)n>n se composent de triangles fortement 
anisotropes alignes avec les discontinuites de /, et de grands triangles dans les regions ou 



/ est reguliere. Le resultat d'approximation (17) fait esperer qu'il existe une estimation 
d'erreur asymptotique precise, pour l'approximation anisotrope des fonctions cartoon, qui 
etende le resultat Q connu lorsque / est C 2 , a savoir 

hmsupiVll/ - % f\\ LHQ) < CA(f). (18) 

Af-s>oo 

Nous n'avons rempli pour l'instant qu'une partie de ce programme : l'extension de la 
fonctionnelle A aux fonctions cartoon. Plus precisement, considerons une fonction (f e 
C°°(M 2 ) radiale, a support compact et d'integrale unite. Definissons tps(z) := ^V 9 (f ) 
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TV ~ A 2/3 > 1 TV ~ 1, A > 1 TV ~ 1, A > 1 

Figure 7 - Comportement des fonctionnelles TV et A sur differents types d'images. 



pour chaque 5 > et notons fg := f * ips la convolution de / avec </?,$. Nous prouvons 
que si / est une fonction cartoon, alors A(f$) converge lorsque S — > vers une expression 
explicite : 



lim A(f 5 ) 2 ' 3 



8^0 



V|det(cP/)| 



2/3 



L 2 / 3 (n\r) 
^/|det(d 2 /(z))|^ + Cfa) 



k|7 



n\r 



2/3 

L 2 /3(r) 

«(s)| 1 / 3 |7(s)| 2 / 3 ds. 



Nous avons note ici T la famille de courbes le long desquelles la fonction cartoon / est dis- 
continue, j(s) le saut de / en un point s G T, et k(s) la courbure de T en s. La constante 
C((p) est strictement positive et s'exprime explicitement en fonction de ip. 



L'extension de la fonctionnelle non lineaire A aux fonctions cartoon met en lumiere 



un lien, explore en £4.4, entre 1' approximation par elements finis sur des triangulations 
anisotropes et plusieurs autres domaines mathematiques. Nous pensons en particulier aux 
methodes presentees dans 22 de traitement de l'image invariant par transformation af- 
fine, et a la definition donnee dans 43 d'espaces de fonctions definis par la regularite des 
lignes de niveau de leurs elements. 



On peut aussi penser la quantite A comme le pendant "d'ordre deux" de la semi-norme 
TV de variation totale, une mesure de regularite definie en termes des derivees d'ordre un 
de / et qui est aussi finie lorsque / est une fonction cartoon. La variation totale joue un 
role central en traitement de l'image et dans l'analyse des equations de transport, deux 
domaines dans lesquels les fonctions regulieres par morceaux apparaissent naturellement. 
Lorsque / est une fonction cartoon, sa variation totale est donnee par la formule suivante : 

TV(/)= / \Vf(z)\dz+ [ | 7 ( a )|d«. 
Jn\r Jr 

Nous comparons le comportement des quantites A(f) et TV(/) pour differentes families 
de fonctions cartoon /. Les quantites TV(/) et A(f) 2 ^ 3 se revelent equivalentes lorsque / 
est la fonction oscillant de maniere lisse f(z) := cos(o;|2|), illustree Figure [7] (i), ou u est 
un grand parametre. En revanche, les discontinuites sont penalisees de maniere differente 
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par ces deux fonctionnelles. Pour une fonction "en escalier", comme illustre Figure [7] (ii), 
la variation totale TV reste bornee alors que A tend vers l'infini a mesure que le nombre 
de marches croit, a cause du terme de saut |7(s)| 2//3 dans l'integrale sur l'ensemble T des 
discontinuites. Par ailleurs, a cause du terme de courbure la fonctionnelle A est 

bien plus grande que TV pour les fonctions caracteristiques d'ensembles ayant un bord 
complexe ou oscillant comme illustre Figure [7| 

La fonctionnelle A peut done etre considered comme un modele d'image quantitatif : 
une image monochrome, decrite par sa luminosite / : [0, l] 2 — > [0, 1], est plausible si A(f) 
est suffisamment petit. A partir de l'approche introduite dans [70], nous proposons un al- 
gorithme de debruitage d'images utilisant un a-priori bayesien fonde sur ce modele. Dans 
sa version actuelle, cet algorithme n'est pas satisfaisant en terme de vitesse de conver- 
gence, et pour cette raison nous presentons uniquement des illustrations numeriques dans 
un cadre simplifie unidimensionel. 

Enfin nous etudions l'extension aux fonctions cartoon des autres quantites non-lineaires 
qui apparaissent en approximation par elements finis sur des triangulations anisotropes, 
comme la norme \\K m>p (d m f)\\ L r de la fonction de forme pour m > 2, ou l'analogue de 
cette quantite lorsque / est une fonction de plus de deux variables. 



Partie III. Generation de maillage anisotrope et metriques rie- 
manniennes 

Les triangulations sont des objets discrets de nature combinatoire : elles peuvent etre 
decrites par une famille de sommets et d'aretes les joignant. Cette description est fruc- 
tueuse pour la demonstration de resultats algebriques comme la formule d'Euler, ou pour 
le traitement informatise. Par contraste, comme explique precedemment, de nombreuses 



approches en adaptation anisotrope de maillage 15 20 66 se fondent sur un objet continu 
equivalent aux triangulations, a savoir une metrique riemannienne z h- > H(z). II s'agit en 
d'autres termes d'une fonction continue H de Q dans l'ensemble S£ des matrices syme- 
triques definies positives. Une fois cette metrique congue convenablement, un algorithme 
de generation de maillages a la charge de fabriquer une triangulation qui lui correspond 



dans le sens de ([7]). L'objectif de la Partie III est de formuler des resultats precis d'equi- 



valence entre certaines classes de triangulations et de metriques riemanniennes. Cette 
equivalence traduit les contraintes geometriques que satisfont les triangulations sous la 
formes de proprietes de regularite des metriques riemanniennes qui leur sont equivalentes. 



Pour enoncer nos resultats nous devons introduire certaines notations. Nous associons 
a chaque triangle T son barycentre G M 2 et la matrice symetrique definie positive 
Ht G S% telle que l'ellipse Et definie par 

S T := {26K 2 ; (z - z T fU T (z - z T )}, 

est l'ellipse d'aire minimale contenant T. Le point zt indique done la position de T, tandis 
que la matrice %t G decrit son aire, son rapport d'aspect et son orientation. 

Nous notons T la famille de toutes les triangulations conformes du domaine infini IR 2 . 
Le choix de considerer des triangulations infinies est guide par la simplicite, et un travail 
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futur sera consacre aux triangulations de domaines polygonaux bornes. Nous notons H : = 
C°(IR 2 , S2) la famille des metriques riemanniennes sur IR 2 . Une metrique H G H associe 
continument a chaque point z G M? une matrice symetrique definie positive H(z) G S^- 
Soit C > 1, nous disons qu'une triangulation T 6 T est C-equivalente a une metrique 
H G H si pour tout T e T et tout z G T nous avons au sens des matrices symetriques 

C- 2 H(z) <U T < C 2 H(z). 

Nous disons qu'une famille T« C T de triangulations est equivalente a une famille H* C H 
de metriques s'il existe une constante uniforme C > 1 telle que 

- Pour chaque triangulation T G T* il existe une metrique H G H* telle que T et H 
soient C-equivalents. 

- Pour chaque metrique if G H* il existe une triangulation T G T* telle que T et H 
soient C-equivalents. 

Nous considerons trois families pertinentes de triangulations de M 2 

Tj,c G T a ,c C T 9>c - 

qui dependent de maniere mineure d'un parametre C > 1. La famille T 9 c des trian- 
gulation etagees, en anglais graded, est definie par la condition suivante qui impose un 
minimum de consistance dans les formes des triangles voisins. Une triangulation T ap- 
partient a T g ,c si pour tous T,T' G T nous avons 

T n TV =^ C- 2 H T < K T ' < C 2 H T - 

Une triangulation appartient a la classe T iy c des triangulations isotropes si T est etagee, 
T G T g ,c, et si les elements de T sont suffisamment proches du triangle equilateral, ce 
qu'exprime la condition suivante : pour tout T G T 

H'HtIIII^t 1 !! < c 2 . 

La classe T a ,c des triangulations quasi-aigues est definie par une condition legerement plus 
technique sur les angles maximaux des triangles, voir Chapitre |5j Comme explique dans 
la description ci-dessus du chapitre 3, eviter les angles trop fortement obtus est necessaire 
pour garantir la stabilite du gradient lorsqu'on applique l'operateur d'interpolation. L'un 
de nos resultats clefs est la reformulation de cette condition sous la forme d'une hypothese 
de regularity de la metrique riemannienne equivalente. D'un point de vue pratique, cette 
condition n'est malheureusement pas garantie par les programmes existants de generation 
de maillage. Les contraintes satisfaites par ces families de triangulations de M 2 sont illus- 
trees pour des triangulations de domaines bornes sur la Figure [8] 

Les resultats du Chapitre [5] etablissent que lorsque C est suffisamment grand, ces trois 
classes de triangulations sont equivalentes a trois families de metriques, respectivement 



Hj C H„ C H 9 , 
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qui sont defmies par des conditions de regularities precises sur la fonction z h- > H(z). En 
particulier la famille Hj est formee des metriques H qui sont proportionnelles a l'identite 
H(z) = Id /s(z) 2 , et telles que le facteur de proportionnalite s satisfait l'une des conditions 
suivantes qui de maniere surprenante sont equivalentes 

• Propriete de Lipschitz Euclidienne : \s(z) — s(z')\ <\z — z'\ pour tous z, z' G M. 2 . 

• Propriete de Lipschitz Riemannienne : |lns(z) — In s(z') \ < cIh(z,z') pour tous z,z' G 
M 2 . 

Rappelons que la distance riemannienne dff(z,z') mesure la longueur, au sens de la me- 
trique H, du plus court chemin joignant z k z' : 

d H (z,z'):= inf / \\j'(t) \\ H (^t))dt 

7(0)=z J 
y(l)=z> 

oil \\u\\m '■— \/u T Mu et ou rinfimum est pris parmi tous les chemins 7 G C 1 ([0, 1], lR d ) 
joignant z a z'. 

Les deux proprietes de Lipschitz presentees ci-dessus s'etendent naturellement aux 
metriques riemanniennes generales, mais ne sont alors plus equivalentes. Pour if G H, on 
pose S{z) := H(z)~2 y et on introduit les deux proprietes distinctes 

d+(H(z),H(z')) < \z - z'\ pour tout z, z G M 2 , (19) 

oil d+(H(z),H(z')) := \\S(z) -S(z')\\, et 

d x {H{z), H{z')) < d H (z, z') pour tout z, z' G 1R 2 . (20) 

ou d x (H(z),H(z')) := \nma 1 x{\\S(z)S(z')- 1 \\, WSiz^Siz)- 1 ]]}. Notons que d + et d x sont 
des distances sur S£- 

Le resultat principal du Chapitre [5] caracterise les families de metriques H 9 et H a 
(associees aux families T 9j c des triangulations etagees, et T a ^c des triangulations quasi- 
aigues) sous la forme des proprietes de Lipschitz ci-dessus. 

Theoreme 3. Si la constante C est suffisamment grande, alors la famille T g ,c des tri- 
angulations etagees est equivalente a la famille H 9 des metriques qui satisfont (20), et la 
famille T a ,c des triangulations quasi- aigues est equivalente a la famille H a des metriques 



qui satisfont simultanement [20] ) et (19). 

Nous donnons Chapitre [6] des applications de ces resultats dans le contexte de la 
theorie de 1' approximation et de la generation contrainte de maillage. Decrivons ce dernier 
exemple. Pour chaque ensemble ferme E C M d et chaque triangulation T G T nous notons 
Vj-(E) le voisinage de E dans la triangulation T, qui est defini comme suit 

V T (E):= |J T. 

TeT 

Nous disons qu'une triangulation T G T separe deux ensembles fermes et disjoints 
X, Y C M. 2 si Vj-(X) fl Vr(Y) = 0. La propriete analogue pour les metriques est la 
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Figure 8 - Generation d'une couche de triangles de largeur 5 le long du cercle unite 
en utilisant 0(<5 _1 ) triangles isotropes (haut gauche), 0{8~^ \ In «5| ) triangles "quasi-aigus" 
(haut droit), 0(5~z) triangles satisfaisant, ou non, la condition d'etagement (bas gauche 
et bas droit). 

suivante : une metrique H G H separe X et Y si dn(x, y) > 1 pour tous x G X et y G Y. 
Nous montrons que ces deux proprietes sont rigoureusement equivalentes, et nous utilisons 
cette reformulation pour calculer le plus petit nombre de triangles (a periodicite pres) re- 
quis pour separer des ensembles (periodiques) en utilisant une triangulation (periodique) 
isotrope, quasi-aigue ou etagee. Comme illustre sur la Figure [8| imposer davantage de 
contraintes sur la triangulation augmente typiquement le nombre de triangles requis pour 
realiser la meme tache. 

La suite de ce chapitre est consacree au controle de l'erreur d' approximation par ele- 
ments finis d'une fonction sur une triangulation T, en norme LP ou W l ' p , par une quantite 
Ci?(/)p, e^(V/) p ou e 9 H (Vf) p attachee a une metrique H equivalente a T. Cette analyse 
fait apparaitre, dans le cas des normes de Sobolev, le role particulier joue par les conditions 
d'angle et de regularite qui definissent les triangulations et metriques appartenant a T a 
et H a respectivement. Finalement nous etendons aux metriques les estimations d'erreur 
asymptotiques developpes pour des triangulations Chapitres 2 et 3. 

Partie IV. Algorithmes de raffinement hierarchique 

La derniere partie de cette these est consacree a l'etude d'un algorithme propose par 
Cohen, Dyn et Hecht qui produit des suites hierarchiques (J~n)n>n de triangulations 
anisotropes (non-conformes) adaptees a une fonction donnee /. Etant donnee une trian- 
gulation T d'un domaine Q et une fonction / G L P (Q), cet algorithme cree en une etape 
une triangulation T 1 de Q, de cardinalite #(T') = #(T) + 1, de la fagon suivante : 

1. (Selection "greedy" du triangle a raffiner) On selectionne un triangle T G T dont la 
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FIGURE 9 - L'algorithme de raffinement : choix du triangle qui maximise l'erreur d'ap- 
proximation (i, en sombre), choix d'une bissection parmi les trois possibilities (ii, iii), 
iteration de ces deux etapes (iv). 

contribution a l'erreur d'approximation est maximale 

T := argmax ||/ - A T 'f\\LP{T>), 
T'eT 

ou At '■ L P (T) — >■ P m _i est un operateur de projection, par exemple la projection 
L 2 (T) orthogonale sur P m _i. 

2. (Choix d'une bissection) Une arete e G {a, b, c} de T est choisie en minimisant une 
fonction de decision donnee e i-> dr(e, /) parmi les trois aretes. Le triangle est 
decoupe le long du segment joignant le point milieu de l'arete choisie au sommet 
oppose, ce qui cree les sous-triangles T} et T e 2 . La nouvelle triangulation est done 

r :=r-{T} + {T e 1 ,T e 2 }. 

Partant d'une triangulation To de cardinality Nq du domaine Q, l'algorithme produit pas 
apres pas une suite (Tn)n>o, voir Figure [9j de triangulations "adaptees" a une fonction 
donnee / G L P (Q). Les proprietes de ces triangulations dependent fortement de la fonc- 
tion approchee / et du choix de la fonction de decision d T (e, f), qui guide la creation de 
l'anisotropie. Par contraste l'operateur de projection At joue un role plutot mineur. Un 
choix typique de la fonction de decision e (-)■ dr(e, f) est l'erreur locale apres bissection, 
en d'autres termes l'algorithme choisit la bissection qui reduit le plus possible l'erreur 
d ' appr oximat ion . 

Nous decrivons cet algorithme plus en detail dans le Chapitre [7J et nous etablissons sa 
convergence au sens ou les approximations polynomiales par morceaux Ar N f definies par 

A TN f{z) =A T {z), zeT, TeT N 

convergent vers / dans LP lorsque N — > oo pour n'importe quelle / G L p , sous certaines 
hypotheses sur la fonction de decision e (-)■ <i T (e, /). Nous discutons aussi de la possibilite 
d'utiliser la structure hierarchique multi-echelle pour definir des approximations multi- 
resolution, des ondelettes et un algorithme de type CART. Nous illustrons l'adaptation 
anisotrope donnee par l'algorithme sur plusieurs types de fonctions et d'images qui pre- 
sentent des transitions rapides le long de lignes courbes. 
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Figure 10 - Triangulation produite par l'algorithme de raffinement hierarchique, et 
interpolation, pour une fonction ayant une variation brusque le long d'une courbe sinu- 
soidale. 



Nous faisons une analyse plus approfondie de la convergence de ralgorithme Chapitre 
[8j dans le cas m = 2 de 1' approximation lineaire par morceaux. Notre resultat princi- 
pal montre que lorsque / est C 2 et strictement convexe, pour un choix particulier de la 
fonction de decision fonde sur l'erreur d'interpolation L 1 , la suite (Tn)n>n de triangu- 
lations generee par cet algorithme satisfait l'estimation asymptotiquement optimale de 
convergence 



limsup N\\f - A TN f hp(n) < C\\ Vl det(d 2 j )\\\ L -r(n), 

ou ^ := 1 + - . L'observation clef qui mene a ce resultat est que lorsque / est un polynome 
quadratique convexe, la minimisation de la fonction de decision choisit V arete la plus 
longue dans la metrique associee a la partie homogene quadratique de ce polynome. Cette 
propriete permet de montrer que les triangles generes par ralgorithme tendent en majorite 
a adopter un rapport d'aspect optimal. Le bon comportement de ralgorithme est aussi 



observe pour des fonctions generates non-convexes, comme illustre Figure 10 Cependant 
prouver le resultat de convergence optimale ci-dessus reste un probleme ouvert dans ce 
cadre. 



Nous etudions Chapitre [9] des variantes de l'algorithme de raffinement hierarchique 
presente ci-dessus. Nous considerons en premier lieu une fonction de decision fondee sur 
l'erreur locale de projection L 2 , qui est particulierement bien adaptee a l'implementation 
numerique car elle peut etre evaluee en un temps machine reduit. 

Nous nous concentrons ensuite sur le comportement de l'algorithme lorsqu'il est appli- 
que a des fonctions cartoon. L'algorithme original ne satisfait pas la meilleure estimation 
possible de convergence pour de telles fonctions. Nous montrons que la vitesse optimale 
de convergence pourrait etre retablie en remplagant la bissection du point milieu d'une 
arete vers le sommet oppose par d'autres choix de decoupages geometriques. 

Finalement nous considerons une autre variante de l'algorithme, fondee sur des rec- 
tangles alignes avec les axes de coordonnees au lieu de triangles de direction arbitraire, 
dans l'esprit des partitions rectangulaires etudiees Chapitre 1. Cette simplification mene 
a un resultat qui garantit la meilleure estimation possible de convergence pour toutes les 
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fonctions C 1 , dans le cas d'approximations constantes par morceaux. 



Introduction (English Version) 



Although this may seem a paradox, all exact science is dominated by the 
idea of approximation. (Bertrand Russel, logician and Nobel prize) 

1 Anisotropic finite element approximation : why and 
how ? 

This thesis is devoted to the problem of approximating functions by piecewise poly- 
nomial finite elements over triangulations, and more general meshes. We are particularly 
concerned with the setting where the mesh is adaptively designed depending on the func- 
tion to be approximated. This mesh may therefore include elements of strongly varying 
size, aspect ratio and orientation. 

Approximation by piecewise polynomial functions is a procedure that occurs in nume- 
rous applications. In some of them such as terrain data simplification, surface or image 
compression, the function / to be approximated might be fully known. In other applica- 
tions such as denoising, statistical learning or in the finite element discretization of PDE's, 
it might be only partially known or fully unknown. In all these applications, one usually 
makes the distinction between uniform and adaptive approximation. In the uniform case, 
the domain of interest is decomposed into a partition where all elements have comparable 
shape and size, while these attributes are allowed to vary strongly in the adaptive case. 
The partition may therefore be adapted to the local properties of /, with the objective of 
optimizing the trade-off between accuracy and complexity of the approximation. 

From an approximation theoretic point of view, the trade-off between accuracy and 
complexity if usually tied to the smoothness properties of the function : typically one 
expects higher convergence rates for smoother functions. Functions arising in concrete ap- 
plications may however have inhomogeneous smoothness properties, in the sense that they 
exhibit areas of smoothness separated by localized discontinuities. Two typical instances 
displayed in Figure [I] and Figure [2] are (i) edges in functions representing images, and (ii) 
shock profiles in the solutions to non-linear hyperbolic PDE's. Numerical procedures for 
image processing, such as image denoising or compression, or for the simulation of PDE's 
greatly benefit from economical and faithful approximations of such functions. A piecewise 
polynomial approximation on a uniform partition of the domain is generally not sufficient 
for these purposes. A first step toward adaptivity is to vary the size of the elements for- 
ming the partition according to the local smoothness properties of the function. A very 
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Figure 1 - A piecewise smooth function (top left), Uniform partition of the domain (top 
right), Partition associated to the adaptive approximation based on the largest coefficients 
in the Haar wavelet basis (bottom left), Partition associated to an adaptive anisotropic 
finite element approximation (bottom right), (credit : G. Peyre) 



similar procedure, often used in image processing, consists in retaining the largest terms 
in a wavelet decomposition, such as displayed on Figure [l] (bottom left). A second step is 
to observe that a higher resolution is needed in the direction orthogonal to the curve of 
discontinuity than in the tangential direction, and to take advantage of this property by 
using an anisotropic partition of the domain. In the two dimensional cases, such partitions 
are typically built from triangles of high aspect ratio aligned with the discontinuities, as 
displayed Figure [I] (bottom right) and Figure [2j 

In the context of numerical PDE's, adaptivity also refers to the fact that the compu- 
tational mesh is not fixed in advance, but instead is dynamically updated based on the 
available information on the exact solution gained through the solution process. From a 
numerical point of view, such methods require more complex algorithms and more intri- 
cate data structures than their non-adaptive counterparts. From a theoretical point of 
view the analysis of these adaptive algorithms, when it is possible, is generally involved. 
As a matter of fact, the improvement brought by adaptivity in terms of convergence rate 
is rigorously established only for few systems of PDE's, and in the case of isotropic finite 
element meshes. We refer to the survey paper [77] for a complete overview on these aspects 
in the case of elliptic equations. We need to mention that these difficulties are exacerba- 
ted when anisotropic elements are used. Several numerical mesh generation software such 
94] , nevertheless successfully use anisotropic adaptativity for the numerical si- 
mulation of PDE's, as displayed for instance on Figure [2} From a numerical point of view, 
the improvement brought by these methods seems obvious compared to non-adaptive or 
adaptive isotropic methods. However, many aspects of the theoretical analysis of aniso- 
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Figure 2 - Airflow around a supersonic plane, computed using a three dimensional highly 
anisotropic mesh, (credit : F. Alauzet |20|.) 



tropic methods remain open questions. 

This thesis studies the problem of anisotropic mesh adaptation for the approximation 
of a known function, which may be regarded as a preliminary step for the analysis of 
anisotropic mesh adaptation for the numerical simulation of PDE's, but may also serve 
in other applications such as terrain data, surface and image processing. 

Given a triangulation T of a bounded polygonal domain Q C M 2 , and a fixed integer 
k > 1, we denote by Vfc(T) the space of finite elements of degree k on T. The space Vfc(T) 
consists of all functions which coincide on each triangle T 6 T with a polynomial of total 
degree k 

V k (T) := {g ; g\ T e P fc , T e T}. 

The dimension of Vfc(T) is of the order (D(k 2 #(T)). Given a function / : O — > R and a 
triangulation T of fi, the best approximation error of / in V&(T) is defined by 



er(f) 



x 



inf ||/ 



x- 



(1) 



The letter X indicates the norm or semi-norm in which the approximation error ||/— g\\x is 
measured. In this thesis we restrict our attention to the L p norm and the W 1,p semi-norm, 
where 1 < p < oo. They are defined as follows : 



\h\ 



and I h 



\Vh\ 



with the standard modification when p = oo. Note that when using the W 1,p semi-norm, 
we need to impose global continuity of g in the above definition of Vk{T). 

The best approximation g 6 Vfc(T) of / can be exactly computed in the case of 
the L 2 norm or the W 1,2 (or if 1 ) semi-norm : g is the orthogonal projection of / onto 
Vk(T) with respect to the scalar product associated to the norm of interest. In the case 
p 7^ 2 of non hilbertian norms the best approximation of / is generally hard to compute, 
but "satisfactory" approximations can be obtained by different methods. If the function 
is smooth (at least continuous), one may use the standard Lagrange interpolant, while 
for non-smooth functions a quasi-interpolant operator is preferred, see Chapter [6j More 
generally, if Pj- is any continuous projection operator from the space X to Vk(T), it is 
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easily seen that for any / el, one has 

\\f-Prfh<Ce r (f)x, (2) 

where C := 1 + ||Pr||x^x- The problem of approximating a function / on a given trian- 
gulation T, using finite elements of degree k, is thus solved in good part. 

In the context of adaptive approximation, the triangulation T of the domain Q is 
not fixed, but can be freely chosen depending on the function / (in contrast we always 
assume in this thesis that the integer k is fixed although arbitrary). This naturally raises 
the objective of characterizing and constructing an optimal mesh for a given function /. 
Given a norm X of interest and a function / to be approximated, we formulate the problem 
of optimal mesh adaptation, as minimizing the approximation error over all triangulations 
of prescribed cardinality. We therefore define the adaptive best approximation error by 

ejv(/)x := inf e T {f)x = inf inf \\f - g\\x- (3) 

#{T)<N #(T)<7V g&V k (T) 

In contrast to the procedure of best finite element approximation on a fixed mesh, adaptive 
and anisotropic approximation is not yet well understood. In particular (i) how does the 
optimal mesh depend on the function f and (ii) how does the optimal error e^(f)x decay 
as N grows! These problems are well understood is the isotropic setting, for which the 
optimization is restricted to triangulations for which all triangles satisfy a uniform shape 
constraint 

diam(T) 2 < C\T\ 

where diam(T) and |T| stand for the diameter and area of T respectively, and C > is 
a fixed constant. In the general setting of potentially anisotropic triangulations, they are 
open problems. 

Heuristically, the simplicity of ([I]) comparatively to ^ is due to the fact that the op- 
timization is posed on the linear space Vfc(T), and that a near best solution may therefore 
be obtained by applying to / a stable projection operator as expressed by (J2|. In contrast 
the optimization problem ^ is posed on the union of spaces Vk(T) for all triangulations 
T satisfying #(T) < N, which is certainly not a linear space. This problem is therefore 
an instance of nonlinear approximation. Other instances include best iV-terms approxi- 
mations in a dictionary of functions, or best approximation by rational function. We refer 



to 42 for a survey on nonlinear approximation. 



The purpose of this thesis is to better understand optimal mesh adaptation posed on 
the full class of potentially anisotropic triangulations. The four parts of the thesis are 
respectively devoted to the four questions below : 

I. How does the approximation error ejv(/)x behaves in the asymptotic regime when 
the number of triangles N grows to +oo, when / is a smooth function? In that 
context, we establish a mathematical characterization of the optimal mesh, as well 
as sharp estimates of e^{f)x by above and below in terms of N and quantities that 
depend nonlinearly on the derivatives of /. 
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II. Which classes of functions govern the rate of decay of CN{f)x as N grows, and are in 
that sense naturally tied to the problem of optimal mesh adaptation ? In particular, 
we have in mind the model of the so-called cartoon functions, which by definition are 
smooth except along a collection of smooth curves of discontinuity. This is a popular 
image model in the image processing community (see for instance Figure [I] for an 
instance of a cartoon image). We shall see that such a model naturally fits in a richer 
function class corresponding to a given rate of decay of eN(f)x- 

III. Could the optimization problem ^ posed on triangulations T of a given cardina- 
lity N, be replaced by an equivalent more tractable problem? Triangulations are 
indeed discrete combinatorial objects, described in terms of points and edges, which 
is not handy when solving optimization problems of the form (|3]). We study the cor- 
respondence between certain classes of triangulations and of riemanninan metrics 
which in contrast are continuous objects. This allows us to reformulate and to solve 
the original optimization problem as a more tractable problem posed on the set of 
riemannian metrics. 

IV. Is it possible to produce a near-optimal sequence of triangulations (Tn)n> with 
#(7jv) = N, using a hierarchical refinement procedure! The property of hierarchy 
guarantees the inclusion of the associated finite element spaces Vfc(7jv) C V^T/v+i)- 
It is required in applications such as progressive encoding or online data processing. 
We provide with a simple and explicit algorithm which gives a positive answer to 
this question under some conditions. 

2 State of the art 

Before detailing the content of the thesis, we give in this section a short overview of the 
state of the art on both problems of the study of ejv(/)x as ^ grows, and the construction 
of a near-optimal triangulation. For that purpose we need to introduce some notations. 
We assume in the following that / 6 C°(Q), where 11 C R 2 is a bounded polygonal 
domain and f2 denotes the closure of fl For each triangulation T of f2 we denote by 1^ 
the standard interpolation operator on Lagrange finite elements of degree k on T : on 
each triangle T G T the interpolation 1^- / is the unique element of which agrees with 
/ on the points of barycentric coordinates in {0, |, • • • , ^p 1 , 1}. In the case k = 1, these 
points are simply the three vertices of T. If the triangulation T is conforming (each edge 
of a triangle is either on the boundary of Q or coincides with the entire edge of another 
triangle), then 1^-/ is continuous. In order to be consistent with the rest of this thesis we 
define m := k + 1 > 2, and we thus have 

e r (/)L P <||/-I^- 1 /llLP(n) and e T (f) w i, P < || V/ - V V?~ 1 f\\ LP{ ny 

Any estimate on the interpolation error thus automatically yields an upper estimate on 
the best approximation error e-j-{f)x- Furthermore if / is sufficiently smooth and if T is 
sufficiently fine, then these quantities are generally comparable. 

One of the founding results of adaptive anisotropic finite element approximation deals 
with the case of piecewise linear elements (m = 2) with the error measured in the LP norm, 
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see [27] or Chapter |5J This result may be stated as follows : for any bounded polygonal 
domain O C f 2 , for any 1 < p < 00, and for any function / G C 2 (Q) there exists a 
sequence (Tn)n>n of triangulations of Q, satisfying #(7jv) < N, and such that 



limsupiV||/-4/|| LP(n) <C Vldet^ 2 /)! (4) 

/V^oo L T (f2) 

where the exponent r G (0, 00) is defined by 

1 1 

-:=! + -, 
r p 

and C is a universal constant (C is independent of p, f2 and /). We recall that the upper 
limit of a sequence (mat)at>at is defined by 

limsupwjv := lim sup u n , (5) 

and is in general strictly smaller than the supremum sup n>n un- It is still an open 
question to find an appropriate upper bound for sup 7V>iVo ||/ — 1^ /||x^p(fi) when optimally 
adapted anisotropic partitions are used. The result Q can be extended to the exponent 
p = 00, and to simplicial partitions of domains Q of higher dimension, but the meshes T/v 
may then not be conforming. 

The result rt4|) reveals that the accuracy of the approximation to / is governed by 



the quantity v/j det(c? 2 /)|, which depends non-linearly on the hessian matrix d 2 f. This 
non-linear dependency is heavily tied to the fact that we authorize triangles of poten- 
tially highly anisotropic shape. The estimate Q is obtained by producing sufficiently fine 
triangulations which combine the two following heuristical properties : 

a) Error equidistribution : the contribution ||/ — 1^ /||jy>m of each triangle T G T/v to the 
global interpolation error ||/ — I7- is approximately the same. This condition 
can be translated into a local constraint on the area of the triangles, which is dictated 
by the local behavior of / and in particular by det(d 2 f (z)) . 

b) Optimal shape of the triangles : the aspect ratio and the orientation of a triangle 
T G T/v is dictated by the ratio of eigenvalues and by the eigenvectors of the hessian 
matrix d 2 f(z) for z G T. 

The simplest method for producing a sequence (Tn)n>n of triangulations satisfying 
Q is to use a "local patching strategy" that may be intuitively described as follows. In a 
first step the domain Q is split into regions 1 < % < k, sufficiently small so that the 
hessian matrix d 2 f(z) varies little on each Qi around an average value Mj. In other words 
/ is well approximated by a quadratic polynomial on each fij. Each region f2j is then tiled 
using a uniform triangulation Tjy which cells have an area, aspect ratio and orientation 
based on Mj, and the triangulations of fli are glued together into a triangulation T/v 
of Q, at the price of a few additional triangles at the interfaces between the fij. 

This construction, which is illustrated on Figure [3] is sufficient for the purpose of 
proving the asymptotical result Q but not for practical applications since it only becomes 
efficient for a large number of triangles. The following approach, based on riemannian 
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metrics, is often preferred in applications. We assume for simplicity that the hessian 
matrix M(z) := d 2 f(z) is positive definite for each z G Q, and we define 

H(z) := A 2 (det M(z))"^M(z) (6) 

where A > is a constant which role is to control the resolution of the triangulation. Such 
a map H is called a riemannian metric, and continuously associates to each point z£(l 
a symmetric positive definite matrix H(z). Note that for each z G £1 the matrix H(z) 
defines an ellipse 

£ z : = {uGl 2 ; u T H(z)u < 1}. 

As illustrated on the left part of Figure [4], a metric encodes at each point z G Q an 
information of area, aspect ratio and orientation, under the form of a symmetric positive 
definite matrix H(z), or equivalently of an ellipse £ z . Several mesh generation algorithms, 



such as 93,94 are able to produce a triangulation adapted to a given metric z i-)- H(z) 
in the sense that for each z G Q the triangle T G T containing 2 has a shape "similar" 
to the ellipse £ z as illustrated on the right part of Figure |4j In mathematical terms, this 
means that one has for each triangle T and zeT, 

b T + c x £ z cTcb T + c 2 £ z , (7) 

where < c\ < C2 are fixed constants and hx is the barycenter of T, or equivalently 
that T is "close" to an equilateral triangle of unit area in the metric H(z). If T is such a 
triangulation and if A is sufficiently large, a heuristic argument (which will be recalled in 
Chapter [2]) shows that 



#Cn||/-I^/||znn) < C||V|det(d 2 /)ill^ ( n), 



where the constant C depends on C\ and c 2 . 

Varying the parameter A we obtain different triangulations 7a, of cardinality propor- 
tional to A 2 , which leads to the error estimate Q. The production of the mesh T from 
the metric H is not straightforward. In addition, there exists no rigorous proof that the 
similarity condition (J7|) is achieved for most mesh generation algorithms, to the notable 



exception of 66 and [15]. 
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Figure 4 - A metric and an adapted triangulation. (credit : J. Schoen |84|) 

The approximation result Q is proved to be optimal if we restrict our attention to tri- 
angulations which satisfy a technical condition defined as follows. We say that a sequence 
(J~n)n>n of triangulations of a polygonal domain Q C 1R 2 is admissible if #(7at) < A" 
and if 

sup I \/N max diam(T) J < oo. (8) 

N>N V t ^Tn J 

For any admissible sequence (Tn)n>n of triangulations of fl, one can establish the lower 
bound 

liminf AT||/ - 1^ /|U, ( o) > c y/\det(d?f)\ , (9) 

where the constant c > is universal (see also Theorem [2] in the outline of the thesis 
below). Furthermore the admissibility condition is not too restrictive : for each e > 
there exists an admissible sequence of triangulations (Tn)n>n which satisfies the upper 
estimate Q up to the additive constant e in the right hand side. 

Similar results to (§4§ and ([9]) can be developed for isotropic meshes, in which the 
triangles may vary in size but are constrained to be isotropic in the sense that the mea- 
sure of degeneracy p(T) := diam(T) 2 /|T| is uniformly bounded by a constant po, see for 
instance [35]. In that case the counterpart to Q has the form 

limsupATH/ - % f\\ LP(n) < C\\d 2 f\\ L r (nh (10) 

N- 



for the same value of r and similarly for the counterpart to (|9]), with with constants c 
and C that depend on the bound p on the measure of degeneracy, see for instance 35 



Therefore the nonlinear quantity a/] det(c? 2 /)| is replaced by the linear d 2 f in the L T norm, 



and these results are now very similar to those of best A^-term wavelet approximation 30 

In terms of the eigenvalues \i(z), \z(z) of the symmetric matrix d 2 f(z) we thus replace 
the geometric mean y/\\i(z)\2(z)\ by max{ | Xi(z) \ , | A2 (z) \ } which may be significantly lar- 
ger when these eigenvalue have different order of magnitude. This is typically the case if 
the approximated function / exhibits strongly anisotropic features, and therefore we may 
expect a significant improvement when using anisotropic meshes for such functions. 



The result Q gives a sharp account of the improvement that anisotropic triangulations 
can provide compared to isotropic triangulations, however in a rather restrictive setting 
which has motivated our work : 
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I. The original result only applies to piecewise linear interpolation error measured in 
the L p norm, while higher order finite elements and Sobolev W 1,p norms are equally 
relevant. In particular the W 1,2 (or H 1 ) norm appears very naturally in the context 
of second order elliptic problems. How should the elements be optimally adapted and 
how should the resulting error bound Q be modified in this more general context ? 

II. The approximated function / needs to be C 2 , while the most interesting applications 
of adaptive approximation involve non smooth and even discontinuous functions, 
such as those appearing in Figures [T] and [2j Can we define in some sense a quantity 
such as det(d 2 /)|||ir(Q) when / is a discontinuous function? 

III. In numerical applications, the riemannian metric z i— > H(z) is used as an intermediate 
tool for mesh generation, but this approach lacks a precise equivalence result between 
these continuous objects and the different classes of anisotropic triangulations. Under 
which conditions on the metric z H- H(z) can we associate a triangulation T which 
agrees with this metric in the sense of (J7|) ? 

IV. The already mentionned anisotropic mesh generation algorithms are not hierarchical, 
in the sense that better accuracy is not achieved by local refinement but by a global 
re-design of the mesh. Can we propose a hierarchical mesh refinement algorithm that 
meets the optimal approximation bound (|4]) ? 

We also need to mention two fundamental issues which are discussed in this thesis, but 
have remained open problems and will be the subject of future research. First the error 
estimate Q only gives asymptotical information, as the number of triangles N tends 
to oo, while an error estimate valid for all values of N is highly desirable. Second the 
extension of this result to functions defined on domains of dimension higher than two is 
hindered by a difficult problem of computational geometry : the production of anisotropic 
and conforming meshes in dimension 3 or higher. This problem is addressed by some 



numerical methods such as 94 but not solved from a theoretical point of view. 



3 Outline of the thesis 

This thesis is composed of four parts which attempt to solve four key issues, numbered 
I to IV, encountered in earlier results on adaptive and anisotropic finite element approxi- 
mation. These four parts are mostly self consistent and can be read independently. The 
chapters that constitute each part should preferably be read sequentially, to the exeption 
of chapter 1 which may be skipped by the reader who wishes to go more directly to the 
heart of the matter. Chapters 1, 2, 3, 4, 7, 8, as well as the third part of Chapter 9, come 



from the papers 9 , 72 , 73 , 37 , 34 , 36 and 35 , respectively. Notations have been 



unified for the purpose of the thesis. Chapters 5 and 6 are the object of future papers. 



Part I. Finite elements of arbitrary degree, and Sobolev norms 

In this part, we discuss generalizations of the asymptotic estimate Q to finite elements 
of arbitrary degree, and to the Sobolev W 1,p semi-norms as measure of error. Through this 
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Figure 5 - Anisotropic partition of a domain into rectangles. 

analysis, we are led to introduce and study some key concepts for optimal mesh adaptation. 

As a starter we consider in Chapter[T]partitions of a rectangular domain into rectangles 
aligned with the coordinate clXCS, clS illustrated on Figure |5j instead of triangles of arbitrary 
direction. Such partitions are relevant when the coordinate axes play a privileged role, 
in such way that the anisotropic features of the approximated function / are likely to 
be aligned with the coordinate axes. In this setting, we obtain an optimal asymptotic 
error estimate. Our results apply to piecewise polynomials of arbitrary degree, when the 
approximation error is measured in the LP norm, and in any dimension d > 1. Here, we 
do not consider W l,p norms, since the approximants are generally discontinuous. 

The biggest advantage of this setting is that the mathematical technicalities required 
for the construction of an anisotropic partition of a domain, as well as the error analy- 
sis, are simplified by the presence of preferred directions. We thus take advantage of this 
simple setting to introduce and study a key concept named the shape function, which go- 
verns the local approximation error after optimal adaptation of the elements to the local 
properties of the approximated function. The shape function is also defined and used for 
anisotropic triangulations which are the object of Chapter 2 and 3. We give below its 
precise definition in this setting. 

Chapter [2] is devoted triangular finite elements of arbitrary degree m — 1 for m > 2, 
with L p as error norm. In order to present our results we need to introduce some notations. 
We denote by H m the space of homogeneous polynomials of degree m : 

H m := Span{xV ; k + I = m}. 

A key ingredient of our approach is the shape function K m<p : H m — > R+, where 1 < p < oo 
is an exponent. This function is defined by an optimization of the LP local interpolation 
error among the collection of triangles of unit area and of all possible shapes : for all 
7T G H m , 

K m ^ir):= inf \\tt - I™" 1 vr|| LP(r) , (11) 

y |T| = 1 ' 

where I™ -1 is the local interpolation operator on T. Our main result is the following 
asymptotic error estimate. 
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Theorem 1. Let Q C M 2 be a bounded polygonal domain, let f G C m (Q) and let 1 < 
p < oo. There exists a sequence (7n)n>n , #(7/v) < N, of conforming triangulations of 
Q such that 



limsupiVT|| / -I--V||L^) 



< 



K, 



m,p 



oTf 



ml 



(12) 



L T (f2) 



where the exponent r G (0, oo) is defined by - :- 



m I 1 
2 p" 



In the error estimate (12), we identify the collection d m f(z) of derivatives of order 
m of / at the point z G Q to the corresponding homogeneous polynomial in the Taylor 
development of / at z. In mathematical form 



d m f{z) 



ml 



k+l=m 



d m f x k y l 
d k xd l y^ldl\' 



Theorem [T] extends the known result ^ to finite elements of arbitrary degree m — 1. 
The quality of the adaptive and anisotropic approximation of / is similarly determined 
by a non-linear quantity of its derivatives : the shape function K myP (d m f (z)) is the "gene- 
ralization" of the determinant yf\ det(d 2 /(;z))| appearing in Q to derivatives of arbitrary 
order. 

This remark raises the need for an in depth study of the shape function K m>p . For m = 2 
we have established that K 2 ^{'k) is proportional to the square root of the determinant of 
the quadratic polynomial n = ax 2 + 2bxy + cy 2 G H 2 : 



K 



2,p 



7T 



C2 



■P 



det 7r| 



C-2 



, P V\ 



ac 



b 2 \ 



where the constant c 2 . p > only depends on the sign of det tt. We thus recover the earlier 
result Q. For m = 3, we establish that the shape function K^^ is the fourth root of the 
discriminant of the homogenous cubic polynomial tt = ax 3 + 3bx 2 y + 3cxy 2 + dy 3 G H3 : 



K 



3,p 



IT 



C 3 



p -\/\ disc tt\ = c 3) p\/|4(ac — b 2 ){bd — c 2 ) — {ad — be) 2 \ 



where the constant c^ tP > only depends on the sign of disc7r. For larger values m > 4 
we have not obtained an explicit expression of the shape function K mtP , but an explicit 
quantity uniformly equivalent to this function. This equivalent has the following form : 
there exists a polynomial Q m (ao, • • • , a m ) of the m+1 variables ao, • • • ,a m and a constant 
C m > 1 such that for all tt = a x m + aix rn ~ 1 y+- ■ ■ a m y m G H m one has with r m := degQ m , 



(ao, 



< K m ^Tr) < C m 



ao, 



(13) 



The polynomial Q T 



is obtained using the theory of invariant polynomials developed by 
Hilbert in 61 . We also characterise the zeros of the shape function, and thus the possible 
cases of "super-convergence" : K m ^{Tr) = if and only if tt has a linear factor ax + by 
of multiplicity s > m/2, in other words the homogeneous polynomial tt is sufficiently 
degenerated. 



The proof of Theorem [T] is based on a "local patching strategy", a two scale mesh 
generation procedure which proceeds as follows : we consider a first "macro-triangulation" 
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TZ of the polygonal domain which is sufficiently fine in such way that the derivatives 
of order m of / vary little on each triangle R G TZ around an average value denoted by 
7Tr G H m . To each polynomial ttr, R G TZ, we then associate a triangle Tr which is a 
minimizer, or a near minimizer, of the optimization problem defining Km,pij^ r) ■ We then 
tile each R G TZ using the triangle Tr properly scaled and its symmetric with respect 
to the origin. Eventually, as illustrated on Figure [3] the triangulation Tn is obtained by 
gluing together the periodic tilings on each R G TZ, with a few additional triangles in 
order obtain a globally conforming mesh. 



The next theorem establishes that the asymptotic error estimate (12) is sharp, at least 



if we restrict our attention to admissible sequences of triangulations, a condition defined in 



(J8J). The approximation result (15) establishes furthermore that the admissibility condition 
is not too restrictive. 



Theorem 2. Let Q C 1R 2 be a bounded polygonal domain, let f G C m (Q) and let 1 < p < 
oo. Let (Tn)n>n , #(7yv) < N, be an admissible sequence of triangulations of ft. Then 



]imMN^\\f-l^f\\ LP{n) 



> 



K, 



rn.p 



d m f 



ml 



(14) 



L T {Vt) 



where ^ := y + Furthermore for any e > there exists an admissible sequence {T^)n>n 
of triangulations of CI, satisfying #(7jv) < N, and such that : 



limsup N 2 ||/ 

N^oo 



jm—l 



/II 



< 



K 



rn.p 



d m f 



ml 



(15) 



L T (Q) 



Chapter [3] is devoted to the counterparts of Theorems [T] and [2] when the interpolation 
error is measured in the Sobolev W 1,p semi-norms, 1 < p < oo. These estimates involve 



the counterpart L m)P 
for all 7T G H m 



— y 



of the shape function K mp which is defined as follows : 



L m ,p{^) 



inf 

m=i 



7T 



jm- 



7T 



\Lp(T)- 



We also provide some explicit equivalents of L m>p (7r), defined similarly to (13) by an 
algebraic expression in the coefficients of 7r G H m . Our results for the W 1,p semi-norm are 
thus extremely similar to the results obtained for the LP norm. 

The adaptation of the proofs however is not straightforward, due to the following new 
phenomenon : the presence of strongly obtuse triangles in a mesh (with one of their angles 
close to 7r) may cause the gradient of the interpolated function to oscillate, as illustrated 
in Figure [6j This phenomenon deteriorates the interpolation error in the W 1,p semi-norm, 
but not in the LP norm. These "flat" triangles therefore need to be carefully avoided. In 
summary, long and thin triangles may be desirable but they should not be strongly obtuse. 

Before turning to description of the rest of this thesis we recall to the reader that the 
three chapters composing Part [I] are self consistent and can be read independently. 
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Figure 6 - Interpolation of a bi-dimensional parabolic function on a mesh built of acute 
(left) or "flat", strongly obtuse, triangles (right). 

Part II. Anisotropic smoothness classes and image models 

In this part that consists of the sole Chapter 4, we discuss the extension of our ap- 
proximation results to non-smooth functions. 

There exists various ways of measuring the smoothness of functions on a domain 
Q C M 2 , generally through the definition of an appropriate smoothness space with an 
associated norm. Classical instances are Sobolev and Besov spaces. Such spaces are of 
common use when describing the regularity of solutions to partial differential equations. 
From a numerical perspective they are also useful in order to sharply characterize at which 
rate a function / may be approximated by simpler functions such as Fourier series, finite 
elements (on isotropic triangulations) , splines or wavelets. 

The result of adaptive anisotropic approximation (|4]) and its generalisation Theorem 
[T] establish that the quality of the approximation of a function / by finite elements on 
anisotropic triangulations is governed by a non-linear quantity of the derivatives of /, at 
least from an asymptotical point of view. In the case of finite elements of degree 1, and 
of the approximation in L 2 norm, the relevant quantity is the following 



The functional A strongly differs from the Sobolev, Holder or Besov norms, because of 
its nonlinear behavior : A does not satisfy the triangle inequality, or any quasi-triangle 
inequality. In other words for any constant C there exists f,g G C 2 (Q) such that 



The lack of a triangular inequality forbids to use the classical techniques of linear analysis 
to define a smoothness space attached to the functional A. The extension of the approxi- 
mation result Q to functions which are not C 2 is therefore not straightforward. 

Functions arising in concrete applications, for instance in image processing or in the 
solution of hyperbolic PDEs, often exhibit areas of smoothness separated by local dis- 
continuities. A mathematical model for this type of behavior is the collection of cartoon 
functions, which are smooth except for a jump discontinuity accross a collection of smooth 




A(f + g)>C(A(f)+A(g)). 



(16) 
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curves. A heuristical analysis presented in Chapter [4] suggests that for any cartoon func- 
tion / defined on a bounded polygonal domain Q, there exists a sequence (7n)n>n of 
anisotropic triangulations of Q, satisfying #(T/v) < N, and such that 



sup N\\f-TLf\\i? 



N>N 



'(n) 



< oo. 



(17) 



As illustrated on Figure [TJ the triangulations (Tn)n>n combine highly anisotropic tri- 
angles aligned with the discontinuities of /, and large triangles in the regions where / is 



smooth. The approximation result (17) raises the hope that a precise and quantitative 



asymptotic error estimate for the adaptive anisotropic approximation of cartoon functions 
extends the known result 

limsupAT||/ - 1^ /|| i2(n) < CA(f), (18) 

which holds if / is C 2 . 

So far we have only fulfilled one part of this program : the extension of the functional A 
to cartoon functions. More precisely, consider a radial and compactly supported function 
ip E C°°(1R 2 ) of unit integral. For any 5 > we define the mollifier (ps(z) := jpLp (|) and 
we denote by fs'—f* <fs the convolution of / with cpg. We establish in Chapter [| that if 
/ is a cartoon function, then A(fg) converges as 5 — > to an explicit expression : 



lim A(f s ) 2/3 



8^0 



V\det(d 2 f)\ 



2/3 

L 2 / 3 (Q\r) 



+ C(<p) 



/c 7 



2/3 

L 2 /3(r) 



y\det{d 2 f(z))\dz + C{ip) / |K(s)| 1/3 |-y(s)| 2 / 3 rfs. 



Here, we have denoted by T the collection of curves where the cartoon function / is dis- 
continuous, by 7(s) the jump of / at the point s G T, and by k(s) the curvature of T at 
s. The constant C(ip) is positive and explicit in terms of ip. 



The extension of the nonlinear functional A to cartoon functions puts in light a link, 
explored in ^^4, between anisotropic finite element approximation and several other ma- 
thematical fields. We think in particular of affine invariant image processing 22 and of 
the definition in 43 of smoothness spaces through the regularity of the level lines of a 
function. 



We also regard the quantity A as a "second order" counterpart to the total variation 
semi-norm TV, a measure of smoothness defined in terms of the first derivatives which 
is also finite for all cartoon functions. The total variation plays a central role in image 
processing and in the analysis of transport equations, two domains in which piecewise 
smooth functions naturally appear. For any cartoon function /, the total variation TV(/) 
is given by the following formula 



TV(/)= / |V/(*)|<fe+ / \l(s)\ds. 
Jn\r Jr 
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TV ~ A 2/3 > 1 TV ~ 1, A > 1 TV ~ 1, A > 1 

Figure 7 - Behavior of the functionals TV and A on different types of images. 



We compare the behavior of the two quantities A(f) 2 ^ 3 and TV(/) for different fami- 
lies of cartoon functions /. It appears that these quantities are equivalent when / is a 
smoothly oscillating function f(z) := cos(u;|z|) with large ui, see figure [7] (i). In contrast 
the discontinuities are penalized in a different manner by these two functionals. For a 
staircase-like function, as illustrated on Figure [7] (ii) , the total variation remains bounded 
while A tends to infinity as the number of steps grows, due contribution |7(s)| 2//3 of the 
jump term. Furthermore, due to the curvature term | s , A is much larger than TV 
for characteristic functions of sets with a complex or oscillating boundary as illustrated 
in Figure [7} 

The functional A can thus be thought of as a new quantitative image model : a mono- 
chromatic image, described by a brightness function / : [0, l] 2 — > [0, 1], is plausible if A(f) 
is sufficiently small. Generalising the work [70] we discuss an image recovery procedure 
using this model as a bayesian prior. At the present time, this algorithm is not satisfac- 
tory due to very high convergence time, and for this reason we present some numerical 
illustrations in a simplified one-dimensional setting. 

Eventually we discuss the extension to cartoon functions of other non-linear quantities 
appearing in anisotropic finite element approximation such as the norm \\K m , p (d m f) \\lt 
of the shape function for m > 2, or the counterpart to this quantitiy for functions / of 
more than two variables. 



Part III. Mesh generation and riemannian metrics 

Triangulations are discrete objects of combinatorial nature : they can be described by 
the collection of their vertices and of the edges between them. This description is fruit- 
ful for the demonstration of algebraic results such as the Euler formula, or for computer 
processing. In contrast, as explained earlier, many approaches towards anisotropic mesh 



adaptation [15,20,66 are based on a continuous equivalent object, namely a riemannian 
metric z i— > H(z), in other words a continuous function H from Q to the set S£ of sym- 
metric positive definite matrices. Once this metric has been properly designed, it is the 
task of the mesh generation algorithm to design a triangulation that agrees with this me- 
tric according to ([7]). The purpose of Part III is to formulate precise equivalence results 



between some classes of triangulations and of riemannian metrics. This equivalence trans- 
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lates some geometrical constraints satisfied by the triangulations into the form regularity 
properties of the equivalent riemannian metrics. 

In order to state our results we need to introduce some notations. We associate to each 
triangle T its barycenter zt G IR 2 and the symmetric positive definite matrix Ht G 
such that the ellipse £t defined by 

S T := {zeR 2 ; (z - z T ) T H T (z - z T )}, 

is the ellipse of minimal area containing T. The point zt thus encodes the position of T, 
while the matrix T-Lt G encodes the area, the aspect ratio and the orientation of the 
triangle T. 

We denote by T the collection of conforming triangulations of the infinite domain 
IR 2 . The choice to consider infinite triangulations is guided by simplicity, and further 
work will be devoted to triangulations of bounded polygonal domains. We denote by 
H := C°(IR 2 , 1S2 ) the collection of riemannian metrics on IR 2 . A metric H G H continuously 
associates to each point z G IR 2 a symmetric positive matrix H(z) G S 1 ^. For C > 1, we 
say that a triangulation T G T is C-equivalent to a metric H G H if for all T G T and 
for all z G T we have 

C~ 2 H(z) < H T < C 2 #(,2), 

where these inequalities are meant in the sense of symmetric positive matrices. We say 
that a class of triangulations T* C T is equivalent to a class of metrics H* C H if there 
exists a uniform constant C > 1 such that 

- For any triangulation T G T* there exists a metric if G H* such that T and H are 
C-equivalent. 

- For any metric H G H* there exists a triangulation T G T* such that T and H are 
C-equivalent. 

We consider three particularly relevant classes of triangulation of IR 2 

Tj,c C T ajC > C T 9>c - 

which have a minor dependency on a parameter C > 1. The class T 9i c of graded trian- 
gulations, is defined by the following condition which imposes a minimum of consistency 
among the shapes of neighbouring triangles. A triangulation T belongs to T g c if for all 
T,T' G T one has 

T n 7V => C~ 2 -H T < H T ' < C 2 ^ r . 

A triangulation T belongs to the class T i)C of isotropic triangulations if T is graded, 
T G T S) c, and if the elements of T are sufficiently close to the equilateral triangle, as 
expressed by the following condition : for all T G T 

IIT^t || llT^y 1 1| ^ C 2 . 

The class T a ,c of quasi-acute triangulations is defined by a slightly more technical condi- 
tion on the maximal angles of the triangles, see Chapter [5} As explained in the earlier 
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description of Chapter 3, avoiding flat angles is needed to guarantee the stability of the 
gradient when applying the interpolation operator. One of our key results is the refor- 
mulation of this condition under the form of a regularity assumption on the equivalent 
riemannian metric. From a practical point of view, this condition is unfortunately not 
guaranteed by existing anisotropic mesh generation software. The constraints satisfied by 
these classes of triangulations of IR 2 are illustrated by some bounded triangulations on 
Figure |8j 

The results of Chapter [5] establish that when C is sufficiently large, these three classes 
of triangulations are equivalent to three classes of metrics respectively 



H,CH a cH 



.</• 



that are defined by precise smoothness conditions on the function z \- > H(z). In par- 
ticular, the class Hj consists of those metrics H which are proportional to the identity 
H(z) = Id /s(z) 2 , and such that the proportionality factor s satisfies one of the following 
surprisingly equivalent properties 

• Euclidean Lipschitz Property : \s(z) — s(z')\ < \z — z'\ for all z, z' G IR 2 . 

• Riemannian Lipschitz Property : | In s(z) — In s(z') | < dn{z, z') for all z, ^Gl 2 . 

We recall that the riemannian distance cIh{z,z') measures the length, distorted by the 
metric H, of the smallest path joining z and z' . 

d H (z,z'):= inf / h'(t)\\ H (j(t))dt 

7(0)=* Jo 
7(1)=*' 



where ||w||a/ := \/u T Mu and where the infimum is taken among all paths 7 G C 1 ([0, 1], IR d ) 
joining z and z' . 

The two lipschitz properties presented above have natural extensions to general ani- 
sotropic riemannian metrics, which are not anymore equivalent. Considering H G H and 
defining S(z) := H(z)~z for all z G IR 2 , these Lipschitz properties are namely 

d+(H(z),H(z')) < \z-z'\ for all z,z'eR 2 , (19) 

where d + {H{z), H(z')) := \\S(z) - S{z')\\, and 

d x {H{z), H{z')) < d H (z, z') for all z, z' G M 2 , (20) 

where d x {H{z),H{z')) := \nmax{\\S(z)S(z')- 1 \\ : \\S(z , )S(z)- 1 \\}. Note that d + and d x 
are distances on S^- 

The main result of Chapter [5] characterizes the classes of metrics H 9 and H a (associated 
to the classes of graded triangulations T Si c and of quasi-acute triangulations T a ,c) in 
terms of the above Lipschitz properties. 

Theorem 3. If the constant C is sufficiently large, then the class T 9) c of graded trian- 



gulations is equivalent to the class H 9 of metrics satisfying (20), and the class T a ,c of 
quasi-acute triangulations is equivalent to the class H a of metrics satisfying simultaneously 
(M) and Hty. 
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Figure 8 - Generation of a layer of simplices of width 5 along the unit circle using 0(5 1 ) 
isotropic triangles (top left), 0(5~2 | In 5\) "quasi-acute" triangles (top right), 0(5~i) tri- 
angles satisfying the grading condition or not (bottom left and bottom right). 

We give in Chapter [6] some applications of these results in the contexts of approxima- 
tion theory and of constrained mesh generation. Let us describe the latter example. For 
any closed set £cK 2 and any mesh T G T we denote by Vj-(E) the neighborhood of E 
in the triangulation T, which is defined as follows 

V r (E):= |J T. 

TeT 

We say that a mesh T separates two disjoint closed sets X and Y if Vq-(X) fl Vj-(Y) = 0. 
The counterpart for metrics of this property is the following : a metric H G H separates 
X and Y if dn{x,y) > 1 for all x G X and y G Y. We show that these two properties 
are rigorously equivalent, and we use this reformulation to compute the smallest number 
of triangles (up to periodicity) required for the separation of some (periodic) sets using 
a (periodic) isotropic, quasi-acute or graded triangulation. As illustrated on Figure [8j 
imposing more constraints on the triangulation typically raises the number of triangles 
needed to achieve the same task. 

The rest of this chapter is devoted to the control of the finite element approximation 
error of a function / on a triangulation T, in LP norm or W 1,p semi-norm, by a quantity 
e if(f)p, e n(^/)p or e #(^ f)p attached to a metric H equivalent to T. This analysis puts 
in light, in the case of Sobolev norms, the role played by the angle or regularity conditions 
which define the elements of T a or H a respectively. Finally, we present a counterpart for 
metrics of the asymptotic approximation results developed for triangulations Chapters 2 
and 3. 

Part IV. Hierarchical refinement algorithms 

The last part of this thesis is devoted to the study an algorithm proposed by Cohen, 
Dyn and Hecht which produces hierarchical sequences {Tn) n>n of (non-conforming) ani- 
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Figure 9 - Refinement algorithm : selection of the triangle which maximizes the local 
approximation error (i, in dark), choice of a bisection among the three possibilities (ii,iii), 
iteration of these two steps (iv). 

sotropic triangulations adapted to a given function /. Given a triangulation T of a domain 
Q and a function / G L P (Q), this algorithm creates in one step a triangulation T' of Q, 
of cardinality #(T') = #(T) + 1, proceeding as follows : 

1. (Greedy triangle selection) A triangle T G T which has a maximal contribution to 
the error is selected 

T : = argmax ||/ - A T > f\\ L p(T>), 
T'eT 

where At '■ L P (T) — > P m -i is a projection operator, for instance the L 2 (T) ortho- 
gonal projection onto P m _i- 

2. (Decision of a bisection) An edge e G {a, b, c} of T is selected by minimizing a a 
given decision function e i-> e£r(e, /) among the three edges. The triangle is refined 
by joining the mid-point of this edge to the opposite vertex creating the sub-triangles 
Tg 1 and T e 2 . The new triangulation is thus 

T':=T-{T} + {TlX)- 

Starting from a triangulation 7o of cardinality N Q of the domain Q, the algorithm pro- 
duces step after step a sequence (Tn)n>n , see Figure [9j of triangulations "adapted" to 
a given function / G L P (Q). The properties of these triangulations strongly depend on 
the approximated function / and on the choice of the decision function c?r(e, /), which 
governs the creation of anisotropy. In contrast the projection operator At plays a rather 
minor role. A typical choice for the decision function e i— > d,T{e, f) is the local error after 
bisection of the edge e, namely the algorithm selects the bisection that most reduces the 
error. 

We present this algorithm in more detail in Chapter [7], and we establish its convergence 
in the sense that the (discontinuous) piecewise polynomial approximation Ar N f defined 
by 

A TN f(z) =A T (z), zeT, TeT N 

converges towards / in LP as iV — > +oo for any / G L p , under some assumptions on 
the decision function e h-> dr(e, /). We also discuss the possibility of using the multis- 
cale hierachical structure to define multiresolution approximation, wavelets and CART 
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Figure 10 - Triangulation produced by the hierarchical refinement algorithm, and in- 
terpolation, for a function exhibiting a sharp transition close to a sinuosidal curve. 



algorithms. We illustrate the anisotropic adaptation of the algorithm on several types of 
functions and images presenting sharp transitions along curved edges. 

In Chapter [8] we make a deeper convergence analysis of the algorithm in the case 
m = 2 of piecewise linear elements. Our main result states that when / is a strictly 
convex C 2 function, then for a particular choice of the decision function based on the 
L 1 local interpolation error, the sequence (Tn)n>n °f triangulations generated by this 
algorithm satisfies the optimal asymptotic convergence estimate 



lim sup N\\f 



f\\ 



LP{Q) 



< c 



det(<# 



where 



1 H — . The key observation leading to this result is that when / is a convex 



quadratic polynomial, the minimization of the decision function selects the longest edge 
in the metric associated to the homogeneous quadratic part of this polynomial. This is 
used to prove that the triangles generated by the algorithm tend in majority to adopt 
an optimal aspect ratio. This good behaviour of the algorithm is also observed on more 



general non-convex C 2 functions, as illustrated on Figure 10 However, proving the above 
optimal convergence bound remains an open problem in this general setting. 

Finally we study in Chapter [9] some variants of the hierarchical refinement algorithm 
presented above. We first consider a decision function based on the L 2 local projection 
error, which is particularly well suited to the numerical implementation since it can be 
evaluated in small computing time. 

We then focus on the behaviour of the algorithm when applied to cartoon functions. 
The original algorithm does not satisfy the best possible convergence estimate for such 
functions. We show that the optimal convergence rate may be recovered when replacing 
the bisection from the mid-point of the edge towards the opposite vertex by more general 
geometric splitting procedure. 

Eventually we consider another variant of the algorithm, which is based on rectangles 
aligned with the coordinate axes instead of triangles of arbitrary direction, in the spirit 
of the rectangular partitions studied in Chapter 1. This simplification leads to a result 
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which guarantees the best possible convergence estimate for all C 1 functions, in the case 
of piecewise constant approximations. 
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Part I 

Optimal mesh adaptation for finite 
elements of arbitrary order 
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partitions 



Contents 

1.1 Introduction 1471 

1.1.1 The projection operator 08] 

1.1.2 History 09] 

1.1.3 Polynomials and the shape function [50] 

1.1.4 Examples of projection operators [51] 

1.1.5 Main results [S3] 

1.2 Study of the shape function 1541 

1.2.1 General properties [55] 

1.2.2 Desirable properties of the projection operator [56] 

1.2.3 Explicit formulas [57] 

1.3 Proof of the approximation results 1611 

1.3.1 Proof of Theorem 1.1.4 (Lower bound) [62] 

1.3.2 Proof of the upper estimates [M] 

1.4 Appendix : Proof of Lemma 1.3.5 1681 



1.1 Introduction 

The purpose of this chapter is to study the adaptive anisotropic approximation of a 
function by interpolating splines denned over block partitions in R d . We use the word 
block as a synonym for "d-dimensional rectangle". Our analysis applies to an arbitrary 
projection operator in arbitrary dimension. We then apply the obtained estimates to 
several different interpolating schemes most commonly used in practice. 

Our approach is to introduce the "shape function" which reflects the interaction of 
approximation procedure with polynomials. Throughout the chapter we shall study the 
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asymptotic behavior of the approximation error and, whenever possible, the explicit form 
of the shape function which plays a major role in finding the constants in the formulae 
for exact asymptotics. 



1.1.1 The projection operator 

Let us first introduce the definitions that will be necessary to state the main problem 
and the results of this chapter. 

We consider a fixed integer d > 1 and we denote by x = (#!,••• , xj) the elements of 
W 1 . A block R is a subset of R. d of the form 

r= n m*] 

i<i<d 

where Oj < b i: for all 1 < % < d. For any block R C R d , by L P (R), 1 < p < oo, we denote 
the space of measurable functions / : R — > R for which the value 



|/(x)| p ^J , if l<p<oo, 

esssup{|/(x)| ; x G i?}, if p = oo. 



p = ||/||LP(fl) 




is finite. We also consider the space C°(R) of continuous functions on R equipped with 
the uniform norm || • We shall make a frequent use of the canonical block I d , where 

I is the interval 

1 1 

2' 2 



I 



Next we define the space V := C°(I d ) and the norm || • \\y := || • ||L°°(i d )- Throughout this 
chapter we consider a linear and bounded (hence, continuous) operator I : V — > V. This 
implies that there exists a constant C\ such that 

|| lu\\ v < Ci||u||v for all u e V. (1.1) 

We assume furthermore that I is a projector, which means that it satisfies 

IoI = I. (1.2) 

Let R be an arbitrary block. It is easy to show that there exists a unique x Q e M. d and a 
unique diagonal matrix D with positive diagonal coefficients such that the transformation 

(f)(x) := x + Dx satisfies (pil d ) = R. (1.3) 

The volume of R, denoted by |i?|, is equal to det(D). For any function / e C°(R) we then 
define 

l R f :=I(/ °0)o <j>-\ (1.4) 

Note that 

11/ - iRfhHR) = (det o - !(/ o 0)|| LP(Id) . (1.5) 
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A block partition 1Z of a block Rq is a finite collection of blocks such that their union 
covers Rq and which pairwise intersections have zero Lebesgue measure. If 7Z is a block 
partition of a block R and if / G C (i2 ), by G L°°(Rq) we denote the (possibly 
discontinuous) function which coincides with l R f on the interior of each block R G 1Z. 

Main Question. The purpose of this chapter is to understand the asymptotic beha- 
vior of the quantity 

11/ - l-R N f\\LP(Ro) 

for each given function f on Rq from some class of smoothness, where (7Zn)n>i is a 
sequence of block partitions of R that are optimally adapted to /. 

Note that the exact value of this error can be explicitly computed only in trivial cases. 
Therefore, the natural question is to study the asymptotic behavior of the shape function, 
i.e. the behavior of the error as the number of elements of the partition TZn tends to 
infinity. 



Most of our results hold with only assumptions (1.1) of continuity of the operator I, 



the projection axiom (1.2), and the definition of Ir given by (1.4). Our analysis therefore 
applies to various projection operators I, such as the L 2 -orthogonal projection on a space 



of polynomials, or spline interpolating schemes described in j: 1.1.4 



1.1.2 History 

The main problem formulated above is interesting for functions of arbitrary smoothness 
as well as for various classes of splines (for instance, for splines of higher order, interpola- 
ting splines, best approximating splines, etc.). In the univariate case general questions of 
this type have been investigated by many authors. The results are more or less complete 
and have numerous applications (see, for example, (69]). 

Fewer results are known in the multivariate case. Most of them are for the case of 
approximation by splines on triangulations (for review of existing results see, for instance 
|i|[T8j|35j[59] and Chapter [2J). However, in applications where preferred directions exist, 
box partitions are sometimes more convenient and efficient. 

The first result on the error of interpolation on rectangular partitions by bivariate 



splines linear in each variable (or bilinear) is due to D'Azevedo 38 who obtained local 
(on a single rectangle) error estimates. In [7] Babenko obtained the exact asymptotics 
for the error (in L 1 , L 2 , and L°° norms) of interpolation of C 2 (I d ) functions by bilinear 
splines. 

In [§] Babenko generalized the result to interpolation and quasiinterpolation of a 
function / G C 2 (I d ) with arbitrary but fixed throughout the domain signature (number 
of positive and negative second-order partial derivatives). However, the norm used to 
measure the error of approximation was uniform. 

In this chapter we use a different, more abstract, approach which allows us to obtain the 
exact asymptotics of the error in a more general framework which can be applied to many 
particular interpolation schemes by an appropriate choice of the interpolation operator. 
In general, the constant in the asymptotics is implicit. However, imposing additional 
assumptions on the interpolation operator allows us to compute the constant explicitly. 
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The chapter is organized as follows. Section 1.1.5 contains the statements of main 
approximation results. The closer study of the shape function, as well as its explicit 
formulas under some restrictions, can be found in Section 1.2 The proofs of the theorems 



about asymptotic behavior of the error are contained in Section 1.3 



1.1.3 Polynomials and the shape function 

In order to obtain the asymptotic error estimates we need to study the interaction of 
the projection operator I with polynomials. 

The notation a always refers to a d- vector of non-negative integers 

a = («!, • • • ,a d ) G TL%. 

For each a we define the following quantities 

let I := > ttj, a\ := I <%iU max(a) := max a,. 

^— ' J- J. Ki<d 

l<i<d l<i<d 

We also define the monomial 

x ° := n x ° i 



% i 

Ki<c 



where the variable is X = {X\, X d ) G IR d - Finally, for each integer k > we define the 
following three vector spaces of polynomials 



n 

ip** 



Vect{X Q 
Vect{X Q 
Vect{X Q 



\a\ < k}, 

max(a) < k and \a\ < k + 1}, (1.6) 
max(ct) < k}. 



Note that clearly dim(P£*) = (k + l) d . In addition, a classical combinatorial argument 
shows that 

. m fk + d\ fk + d+l\ 

dimF fe = I I and dimF fc = dimF fc+1 — a = I J — a. 

By V\ we denote the image of I, which is a subspace of V — C°(I d ). Since I is a 
projector (1.2), we have 

V4 = {I(/): /en = {/^: / = I(/)}- (L7) 

From this point on, the integer k is fixed and defined as follows 

k = k(T) := max{fc' > ; P fc / C ^} (1.8) 

Hence, the operator I reproduces polynomials of total degree less or equal than k. (If 
k = oo then we obtain, using the density of polynomials in V and the continuity of I, that 
I(/) = / for all / G V. We exclude this case from now on.) 
In what follows, by m we denote the integer defined by 

m = m(I) ■= k + 1, (1.9) 
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where k = k(l) is defined in ( 1.8 ). By H m we denote the space of homogeneous polynomials 
of degree m 

H m := Vect{X a ; |a| = m}. 
We now introduce a function Kj on H m , further referred to as the "shape function". 



Definition 1.1.1 (Shape Function). For all n G M m 

K^n) := inf [| vr - l R it\\lp<r), 

\R\=1 



;i.io) 



where the infimum is taken over all blocks R of unit d- dimensional volume. 

The shape function K plays a major role in our asymptotical error estimates developed 
in the next subsection. Hence, we dedicate §1.2| to its close study, and we provide its explicit 
form in various cases. Note that for any A > 0, since tt is homogeneous of degree m 



inf || vr - l R ir\\ L p(R) = A d+ vK I (-n). 
\R\=A " \ i 



;i.ii) 



The optimization (1.10) among blocks can be rephrased into an optimization among 
diagonal matrices. Indeed for any block R there exists a unique xo G M d and a unique 
diagonal matrix with positive coefficients such that R = <p(I d ) with 4>(x) = xq + Dx. 



Furthermore, the homogeneous component of degree m is the same in both ir o and 
7i o D, hence 7ro0 — 7roD g (recall that m = k + 1) and therefore this polynomial is 
reproduced by the projection operator I. Using the linearity of I, we obtain 

7T O — I(7T O (f)) = 71 O D — I(tT O D) . 



Combining this with (1.5), and observing that detD = \R\, we obtain that 



K i (tx)= inf || o D - 1(tt o D)\\ LP nd )} 

det D=l 



;i.i2) 



£»0 



where the infimum is taken over the set of diagonal matrices with non-negative entries 
and unit determinant. 



1.1.4 Examples of projection operators 

In this section we define several possible choices for the projection operator I which are 



consistent with (1.8) and, in our opinion, are most useful for practical purposes. However, 



many other possibilities could be considered. 

Definition 1.1.2 (L 2 (I d ) orthogonal projection). We may define 1(f) as the L 2 (I d 
thogonal projection of f onto one of the spaces of polynomials P^, kP* k or JP* k * defined in 



or- 



(1-6). 



If the projection operator I is chosen as in Definition 1.1.2 then a simple change 



of variables shows that for any block R, the operator I# defined by (1.4) is the L 2 (R) 
orthogonal projection onto the same space of polynomials. 
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To introduce several possible interpolation schemes for which we obtain the estimates 
using our approach, we consider a set Uk C I of cardinality #(£4) = k + 1 (special cases 
are given below). For any u = (m, ■ ■ ■ Ud) E U d we define an element of P* k * as follows 



mx) : = n 



Ki<d 



( \ 
-p-r Xj-V 

«ec/ fe 



G pr. 



Clearly, /x u (u) = /i u («i, • • • ,u d ) = 1 and/i u (v) = ^(^i, • • • , t> d ) = if v = (i^, • • • ,v d ) G 
C/| and v/u. 

It follows that the elements of B := (/x u ) u ec/ d are linearly independent. Since #(7>) = 
#([/|) = (Jfe + l) rf = dim(Pr), B is a basis of Pf- 

Therefore, any element of \i E P* k * can be written in the form 

fi(x) = Ki*u(x). 

ueu£ 

It follows that, for any given / G C°(I d ), there exists a unique element of \i E P* k * such 
that /i(u) = /(u) for all u G L^. We define 1/ := /i, namely 

(l/)(X) := ^/(uK(J)ePr. 

We may take Uk to be the set of k + 1 equi-spaced points on I 

17* = {-^ + | 5 0<n<A;J. (1.13) 

We obtain a different, but equally relevant, operator I by choosing Uk to be the set of 
Tchebychev points on I 



u h = |2 cos ItJ ; °- n - k j- (L14) 

Different interpolation procedures can be used to construct I. Another convenient inter- 
polation scheme is to take 

and I(/) = / on a subset of U d . This subset contains dimP£ points, which are convenient 
to choose first on the boundary of I d and then (if needed) at some interior lattice points. 
Note that since dimP£ < #(Z7^) = (k + l) d , it is always possible to construct such an 
operator. 

If the projection operator I is chosen as described above, then for any block R and 
any / G C°(R), Ir(/) is the unique element of respective space of polynomials which 
coincides with / at the image (p(p) of the points p mentioned in the definition of I, by the 
transformation described in ( 1.3| ). 
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1.1.5 Main results 

In order to obtain the approximation results we often impose a slight technical restric- 
tion (which can be removed, see for instance jij) on sequences of block partitions, which 
is defined as follows. 

Definition 1.1.3 (Admissibility). We say that a sequence {1Zn)n>i of block partitions of 
a block R Q is admissible if #(7£jv) — N for all N > 1, and 



sup I N d sup diam(_R) ) < oo 

N>1 V ReK N 



(1.15) 



We recall that the approximation error is measured in LP norm, where the exponent 
p is fixed and 1 < p < oo. We define r G (0, oo) by 



lml 

t dp 



;i.i6) 



In the following estimates we identified d m f(x) with an element of H m according to 



d m f{x) 



ml 



|o|=m 



d m f{x) X a 
dx a a\ 



(1.17) 



We now state the asymptotically sharp lower bound for the approximation error of a 
function / on an admissible sequence of block partitions. 

Theorem 1.1.4. Let R be a block and let f e C m (i?o)- For any admissible sequence of 
block partitions (R-n)n>i °f Rq 



limMN i\\f -I Un f\\ LP(Ro) > 



Kj 



d m f 



ml 



L-(R ) 



The next theorem provides an upper bound for the projection error of a function / 
when an optimal sequence of block partitions is used. It confirms the sharpness of the 
previous theorem. 

Theorem 1.1.5. Let R be a block and let f e C m (Ro). Then there exists a (perhaps 
non-admissible) sequence (R-n)n>i, #(^jv) — N, of block partitions of R satisfying 



limsupiVd ||/ - l nN /||z,p(h ) < 



K, 



d m f 



m\ 



;i.i8) 



Lt(Ro) 



Furthermore, for all e > there exists an admissible sequence (TZ £ n )n>i of block par- 
titions of Ro satisfying 



limsupiVd ||/ - Ik> fhp(R ) < 



Kj 



oTf 



ml 



+ £. 



;i.i9) 



L-r(Ro) 
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An important feature of these estimates is the "limsup". Recall that the upper limit 
of a sequence (itjv) n> No is defined by 

limsup Mat := lim sup u n , 

and is in general strictly smaller than the supremum sup n>n un- It is still an open 
question to find an appropriate upper estimate of sup N>N N~d\\f — Ik n /||lp(_r ) when 
optimally adapted block partitions are used. 

In order to have more control of the quality of approximation on various parts of the 
domain we introduce a positive weight function Q E C°(Rq). For 1 < p < oo and for any 
u G L P (R ) as usual we define 



M|i>(Ro,n) := ||ufi||.D>(.R ). 



(1.20) 



Remark 1.1.6. Theorems 1.1. 4\ \l-l-5\ and \1.1.7\ below also hold when the norm \\ ■ \\lp(Ro) 
(resp || ■ \\l t {r )) is replaced with the weighted norm \\ • ||lp(_r ,^) ( r ^ s V \\ • \\l t (Ro&)) defined 



in (1.20) 



In the following section we shall use some restrictive hypotheses on the interpolation 
operator in order to obtain an explicit formula for the shape function. In particular, 
Propositions 1.2.7, 1.2.8| and equation (1.21) show that, under some assumptions, there 
exists a constant C — C(I) > such that 



1 u fd m f 



< 



\ 



n 

Ki<d 



d m f 
dxT 



< CKi 



aTf 



ml 



These restrictive hypotheses also allow to improve slightly the estimate (1.19) as follows 



Theorem 1.1.7. If the hypotheses of Proposition fTjT? or 1.2.8 hold, and if Kj (^f) > 
everywhere on R , then there exists an admissible sequence of block partitions {TZn)n>i 



of Rq which satisfies the optimal estimate (1.18). 



The proofs of the Theorems 1.1.4 1.1.5 and 1.1.7 are given in £1.3 Each of these 



proofs can be adapted to weighted norms, hence establishing Remark |1.1.6| Some details 
on how to adapt proofs for the case of weighted norms are provided at the end of each 
proof. 



1.2 Study of the shape function 



In this section we perform a close study of the shape function Kj, since it plays a 

we investigate 



major role in our asymptotic error estimates. In the first subsection £1.2.1 



general properties which are valid for any continuous projection operator I. However, we 
are not able to obtain an explicit form of Kj under such general assumptions. Recall that 
in §1.1.4 we presented several possible choices of projection operators I that seem more 
likely to be used in practice. In §1.2.2 we identify four important properties shared by 
these examples. These properties are used in §1.2.3 to obtain an explicit form of Kj. 
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1.2.1 General properties 

The shape function K obeys the following important invariance property with respect 
to diagonal changes of coordinates. 

Proposition 1.2.1. For all ir G M m and all diagonal matrices D with non-negative 
coefficients 

Kj{iroD) = {det D^K^n). 

Proof: We first assume that the diagonal matrix D has positive diagonal coefficients. Let 
D be a diagonal matrix with positive diagonal coefficient and which satisfies detD = 1. 
Let also 7r G H m . Then, since it is homogeneous and of degree m 

no(DD) = vr o ((det D)*D) = (det D)^n o £>, 

where D := (det D)^^DD satisfies det-D = det D = 1 and is uniquely determined by D. 



According to (1.12) we therefore have 



Ki(icoD) = inf \\n o (DD) o (DD))\\ LPm 

detL>=l 

= (detD)^ inf \\ir o D — l(it o D)\\ LP ad) 

det D=l 

= (det D)^K j (tt), 

which concludes the proof in the case where D has positive diagonal coefficients. 
Let us now assume that D is a diagonal matrix with non-negative diagonal coefficients and 
such that det(-D) = 0. Clearly there exists a sequence (D n ) n > of diagonal matrices with 
positive coefficients, such that det-D n = 1 for all n > and that DD„ — > as n — > oo. 
Therefore 7r o (DD n ) — > 0, which implies that K(tt o D) = and concludes the proof of 
this proposition. o 

The next proposition shows that the exponent p used for measuring the approximation 
error plays a rather minor role. By K p we denote the shape function associated with the 
exponent p. 

Proposition 1.2.2. There exists a constant c > such that for all 1 < p\ < p<i < oo we 

have on JH m 

cK < K < K < K 

Proof: For any function / G V = C°(I d ) and for any 1 < p\ < p 2 < oo by a standard 
convexity argument we obtain that 



L!(I d ) — ll/IU p i(I d ) — 11/ lll,P2(I<J) < ||/||L°°(nd)- 



Using (1.12), it follows that 

K x < K P1 < K P2 < 
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on H m . Furthermore, the following semi norms on H r 



Mi := ||tt — l7r||£ifl[d) and \tt\ 



I 7T — I 7T I 



L°°(I d ) 



vanish precisely on the same subspace of H m , namely V\ fl H m = {n G H m ; n = In}. 
Since H m has finite dimension, it follows that they are equivalent. Hence, there exists a 
constant c > such that c| • ^ < | • |i on H m . Using (1.12), it follows that cK^ < Ki, 
which concludes the proof. o 



1.2.2 Desirable properties of the projection operator 

The examples of projection operators presented in §1.1. 4| share some important pro- 
perties which allow to obtain the explicit expression of the shape function Kj. These 
properties are defined below and called H±, H a , H* or H^. They are satisfied when the 



operator I is the interpolation at equispaced points (Definition 1.13), at Tchebychev points 



(Definition 1.14), and usually on the most interesting sets of other points. They are also 
satisfied when I is the L 2 (I d ) orthogonal projection onto JP* k or JP* k * (Definition 1.1.2). 



The first property reflects the fact that a coordinate Xi on I d can be changed to — Xi, 
independently of the projection process. 

Definition 1.2.3 (H± hypothesis). We say that the interpolation operator I satisfies the 
H± hypothesis if for any diagonal matrix D with entries in ±1 we have for all f G V 

l{foD)=l{f)oD. 

The next property implies that the different coordinates X\, ■ ■ ■ , Xd on I d play symme- 
trical roles with respect to the projection operator. 

Definition 1.2.4 (H a hypothesis). If M a is a permutation matrix, i.e. (M a )ij := b~i a (j) 
for some permutation o of {1, ■ ■ • , d}, then for all f 6 V 

l(foM a )=l(f)oM a . 



According to (1.8), the projection operator I reproduces the space of polynomials P^. 
However, in many situations the space V\ of functions reproduced by I is larger than P^. 
In particular V\ = JP* k * when I is the interpolation on equispaced or Tchebychev points, 
and V\ = Pfc (resp P£, P") when I is the L 2 (I d ) orthogonal projection onto P& (resp P£, 

PD. 

It is particularly useful to know whether the projection operator I reproduces the 
elements of P* k , and we therefore give a name to this property. Note that it clearly does 
not hold for the L 2 (I d ) orthogonal projection onto P&. 

Definition 1.2.5 (H* hypothesis). The following inclusion holds : 

Pi C Vl 

On the contrary it is useful to know that some polynomials, and in particular pure 
powers x™, are not reproduced by I. 
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Definition 1.2.6 (if** hypothesis). 

If ^2\x?eVi then (Ai,--- ,A d ) = ((),-•• ,0). 



Ki<d 



This condition obviously holds if I(/) G P£* (polynomials of degree < k in each 
variable) for all /. Hence, it holds for all the examples of projection operators given in 



the previous subsection { 1.1.4 



1.2.3 Explicit formulas 

In this section we provide the explicit expression of the shape function K when some 
of the hypotheses H±, H a , or ii** hold. Let it G H m and let A, be the coefficient of 
X m in 7r, for all 1 < % < d. We define 



Ki<d 



and 



s(tt) := #{1 <i<d; Xi>0}. 



If 



d m f(x) 



is identified by (1.17) to an element of H m , then one has 



iC 



d m f{x) 



m 



\ 



Ki<d 1 



Proposition 1.2.7. If m is odd and if H±, H a and hold, then 

K p (ir)=C(p)K.(ir), 

where 



C{p) :- 



> 0. 



LP(I d ) 



l<i<d \l<i<d 

Proposition 1.2.8. If m is even and if H a , if* and if** hold then 
Furthermore, 



C(p,0) = C(p,d) 



l<i<d \l<i<d J 



> 0. 



LP(l d ) 



Other constants C(p, s) are positive and obey C(p, s) = C(p,d — s) . 



;i.2i) 



;i.22) 



Next we turn to the proofs of Propositions 1.2.7 and 1.2.8 
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Proof of Proposition 1.2.7 Let ir E H. m and let A; be the coefficient of X™ in n. 
Denote by 

E w 



7T* 



Ki<d 



so that 7r — 7r* G P£ and, more generally, it o D — tt»o D g P£ for any diagonal matrix .D. 
The hypothesis if* states that the projection operator I reproduces the elements of P* k , 
and therefore 

7T o D - I(tt o D) = 7T* o D - Ktt, o D). 



Hence, Ki(tt) = Kjfa) according to (1.12). If there exists io, 1 < iq < rf, such that 



Aj = 0, then we denote by -D the diagonal matrix of entries Du — 1 if % ^ and if 
i = Iq. Applying Proposition |1.2. 1" we find 

^(tt) = Kj{^) = KfaoD) = (det D^Kfa) = 0. 

which concludes the proof. We now assume that all the coefficients Aj, 1 < i < d, are 



different from 0, and we denote by E{ be the sign of Aj. Applying Proposition 1.2.1 to the 
diagonal matrix D of entries | Aj | m we find that 

Kj{tt) = KiM = (det DpK^ o D~ l ) = K*(tt) Kj f £ ) . 

\l<i<d / 

Using the H± hypothesis with the diagonal matrix D of entries Du = £j, and recalling 
that m is odd, we find that 

k i ( E £ * x n = ^ ( E x * 

\l<i<d / \l<i<d 



We now define the functions 



# := X? - l(X™) for 1 < i < d. 



It follows from (1.12) that 



Kj ( e *r ) 

\l<i<d / 



inf 

IIl<i<dOi=l 



E 



Ki<d 



LP(I d ) 



where the infimum is taken over all d- vectors of positive reals of product 1. Let us consider 
such a d- vector (ai, • • • , a^), and a permutation a of the set {1, • • • , d}. The Her hypothesis 
implies that the quantity 



E a °(i)9i 



Ki<d 



LP(I d ) 
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is independent of a. Hence, summing over all permutations, we obtain using the triangle 
inequality 



Y a i& 



Ki<d 



LP(I d ) 



> 



1 

d~\ 

1 

d 



\Y Y a ^ 

l<i<d 

Y [Y a ^))9 

<i<d \ a / 



LP(I d ) 



[1.23) 



Y 3i 



Ki<d 



LP(l d ) 



Y ai - 

LP(Jd) l<i<d 



The right-hand side is minimal when a± 



ad = 1, which shows that 



Y aigi 


> 


Y 9i 


= C(p) 


l<i<d 


LP{l d ) 


l<i<d 


LP(I d ) 



with equality when a; = 1 for all i. Note as a corollary that 

Kfa) = ||7T £ - l(n e )\\ LP(Id) = C(p) where vr e = ]T ^Xf- (1-24) 



Ki<d 



It remains to prove that C(p) > 0. Using the hypothesis H±, we find that for all /i« G {±1} 
we have 



Y Vi9i 

l<i<d 

In particular, for any 1 < iq < d one has 



C(p). 



LP (1^) 



2\\9i 1 LP(I d ) < 


Y 9i 




+ 


2 9io ~ Y 9i 




l<i<d 


LP (I 


d ) 


\<i<d 



< 2C(p). 

LP(l d ) 

If C(p) = 0, it follows that g io = and therefore that X™ = I(X™), for any 1 < i < d. 
Using the assumption H*, we find that the projection operator I reproduces all the poly- 



nomials of degree m — k+ 1, which contradicts the definition (1.8) of the integer k. o 



Proof of proposition 1.2.8 We define Aj, 7r* and £j G {±1} as before and we find, 
using similar reasoning, that 



For 1 < s < d we define 



K^tt) = K^)Kr ( Zi*? ) ■ 

\l<i<d / 

c(p,s)-.= K I (j2xr- y x n 

\l<i<s s+l<i<d / 
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From the hypothesis H a it follows that Kj(n) = K*(tt)C(p, s(tt)). 

Using again H a and the fact that Kj(it) = Kj(—7r) for all tc G H m , we find that 

c(p, S ) = Kj(j2 x ™- E x n 

\l<i<s s+l<i<d J 

xr- E x i 



K 1 ^ 

a<i<'/- 
C(p,d — s). 



s+l<i<d 



We define Qi := X™ — I(X™), as in the proof of Proposition 1.2.7. We obtain the 



;ion 


1.2.7 


(1.23 


) 



expression for C(p, 0) by summing over all permutations as in (1.23) 

C(p,0) = 



E 9i 

Ki<d 



LP(I d ) 



This concludes the proof of the first part of Proposition 1.2.8 We now prove that C(p, s) > 
for all 1 < p < oo and all s £ {0, • • • , d}. To this end we define the following quantity 



on 



\a\\K 



E a ^< 

Ki<d 



LP(I d ) 



Note that \\cl\\k = if and only if 

^*r= E 



Ki<d 



Ki<d 



and the hypothesis if** precisely states that this equality occurs if and only if a,i = 0, for 
all 1 < % < d. Hence, II • ||# is a norm on M d . Furthermore, let 



a e Mi x 



n n = 1 



Then 



C(p, s) = inf ||a||x- 



Since £7 S is a closed subset of lR d , which does not contain the origin, this infimum is 
attained. It follows that C(p, s) > 0, and that there exists a rectangle R e of unit volume 
such that 



Ki(TTe) = Ike-I^IU^) = C(p,s(n £ )) where vr e = ^ ^Xf, (1.25) 
which concludes the proof of this proposition. 
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1.3 Proof of the approximation results 

In this section, let the block R , the integer m, the function / G C m (Ro) and the 
exponent p be fixed. We conduct our proofs for 1 < p < oo and provide comments on how 
to adjust our arguments for the case p = oo. 

For each i 6 K by ^ e P m we denote the m-th degree Taylor polynomial of / at 
the point x 

, X = , X {X):= £ ^(z)^T^ 

|a|<m 

and we define ir x G H m to be the homogeneous component of degree m in fi x , 

om £ yet 

|a|=m 

Since ^ and /i^ are polynomials of degree m, their m-th derivative is constant, and clearly 
d m TT x = d m n x = d m f(x). In particular, for any x G Rq the polynomial \x x — tt x belongs to 
Pfc (recall that k — m — 1) and is therefore reproduced by the projection operator I. It 
follows that for any x <E R and any block R 

n x ~l R 7i x = jj x -l R fx x . (1.28) 

In addition, we introduce a measure p of the degeneracy of a block R 

diam( J R) d 



p(R) 



\R\ 



Given any function g G C m (R) and any x G R we can define, similarly to (1.27), a 
polynomial n x G H m associated to g at x. We then define 

||d m fl r IU«( J R) : = SUp SUp |7Ta.(u)| . (1.29) 
x&R \\u\=l J 

Proposition 1.3.1. There exists a constant C = C(m,d) > such that for any block R 
and any function g G C m (R) 

\\g - l R g\\ L p{ R ) <C\R\^ p{R)^\\d m g\\ L °° {R) . (1.30) 

Proof: Let x G R and let g be the Taylor polynomial for g of degree m — 1 at point x 
which is defined as follows 

9o{X): - ^ ~dx~- a\ • 

|a|<m— 1 

Let x G R and let x(t) = x + t(x — x ). We have 

f 1 (1 - t) m 

g(%) = 9o(x) + I d m g xi t){x - x ) — — } — dt. 



t=o 



ml 
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Hence, 



\g(x)-g (x)\ ^ 
< 



\d m g\\L°°(R)\x - x \ 



ty 



-dt 



t=o. 



1 



\d m g\\L^(R) diam(i?) 



mi 

m 



(m + 1)! 

Since g is a polynomial of degree at most m — 1, we have go = I go- Hence, 

i 

\\g - Ir9\\lp{r) < \R\ p \\g - lRg\\L°°(R) 

= \R\*\\{g-9o) -Ir(9-9o)\\l°°{r) 
< (l + Ci)\R\p\\g- g \\L°°(R), 



'1.311 



where C\ is the operator norm of I : V — > V. Combining this estimate with (1.31), we 
obtain (1.30) which concludes the proof. 



1.3.1 Proof of Theorem 1.1.4 (Lower bound) 



The following lemma allows us to bound the interpolation error of / on the block R 
from below. 

Lemma 1.3.2. For any block R C Ro and x G R we have 

||/-I*/||lp(B) > \R\* {Kj(n x )-uj(dmmR)p(R)^), 

where the function uj is positive, depends only on f and m, and satisfies uj(5) — > as 
5^0. 



Proof: Let h := / — fj, x , where \i x is defined in (1.26). Using (1.28) and (1.11), we obtain 



||/ — lRf\\LP(R) > \\tTx — IrKx\\lp(R) — \\h — lRh\\LP(R) 

> \R\^ Ki(tt x ) - \\h - l R h\\ L P(R), 



and according to Proposition |1.3.1| we have 

\\h-\ R h\\ LP{R) <Co\R\^p{R)^\\d m h\\ Lao[R) . 

Observe that 

\\d m h\\ L ~ {R) = \\d m f - d m -K x \\ L ^ R) = \\d m f - d m f(x)\\ L ~ {R) . 
We introduce the modulus of continuity of the m-th derivatives of /. 



w*(r) := sup 

xi,x 2 £Ro: 
\xi—X2\<r 



\d m f{x 1 )-d m f{x 2 )\\ 



SUp I SUp \lt Xl (U) - 7T X2 (u)\ 

xi^eRo- \M<i 

\xi—x 2 \<r 



1.32) 



By setting u = Cq we conclude the proof of this lemma. 
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We now consider an admissible sequence of block partitions (7Zn)n>o- For all N > 0, 
R eIZn and x G int(-R), we define 

4>n(x) '■= \R\ and i^n{x) '■= (Kii^x) — w(diam(i?))p(i?)^) + , 

where A + := max{A,0}. We now apply Holder's inequality 



Ho 



/l/2<||/l|| II/2II 



with the functions 



I II T 

d 



fi = <Pn Vn and h = 0iv 
and the exponents pi — - and P2 = Note that i + 1 



Pi P2 \P d 



1. Hence, 



mp ^ p 

Vn < [ I 4>JV N 

Ro \JRo 



Ro 



1.33) 



Note that j Rn cf)^ = #(1Zn) < N. Furthermore, if R G TZn and x G int(-R) then according 
to Lemma [1.3.21 



(j) N {x)™4) N {x) = \R\- pip N (x)<\R\ p\\f -Ir/Wlp 



(«)■ 



Hence, 



mp 



URo 



< 



p 

Lp(R) 



11/ - I.R.f||z>(iio)- 



1.34) 



Inequality (1.33) therefore leads to 



1.35) 



Since the sequence (7?.Ar)jv>o is admissible, there exists a constant Ca > such that for 
all N and all R G TZn we have diam(i?) < CaN~~s. We introduce a subset of 7^ C 7Zn 
which collects the most degenerate blocks 

K' N = {ReK N - p{R) > uj(C a N-^}, 



where u is the function defined in Lemma 1.3.2 By R' N we denote the portion of Rq 



covered by 1Z' N . For all x e R \ R' N we obtain 

M*) > Kj^-ujiCAN--*) 1 --*. 
We define En '■= (^{CaN^^) 1 ^^ and we notice that En — > as — > 00. Hence, 

\l-(R ) ^ II ( K iM ~ £n) + \\ T lt{R(AR , n) 

IT 

\l^(R ) 



> ii(^(^)-^) + n; r ^-ci^i, 
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where C := max xe R Ki(tt x ). The last expression involves a slight abuse of notations, since 
(Ki(tt x ) — £n)+ stands for the function i6i? 4 (Ki(ir x ) — £jv)+ G K. Next we observe 
that \R' N \ — > as N — > +oo : indeed for all R G 1Z' N we have 

\R\ = diam( J R) d p( J R)- 1 < 0^-^(0 A N'^ . 

Since #(1Z' N ) < N, we obtain \R' N \ < C d A oj[C '^iV - ^)™ , and the right-hand side tends to 
as N — > oo. We thus obtain 



lim inf 

AM-oo 



l t (Ro) > Jim IICK/^) -e N ] 



7V-s>oo 



+ \\Lt(Ro) 



\Ki{^x)\\l-(r )- 



Combining this result with (1.35), we conclude the proof of the announced estimate. 



Note that this proof also works with the exponent p = 00 by changing 



mp \ p 



Ro 



into 



in (1.33) and performing the standard modification in (1.34). 



Remark 1.3.3. As announced in Remark 1.1. 6\ this proof can be adapted to the weighted 
norm \\ ■ \\lp(Ro,ci) associated to a positive weight function Q G C°(Rq) and defined in 



(1.20). For that purpose let r N := sup{diam(_R) ; R G TZn} and let 

VIn{x) := inf Q(x'). 

x'&R 
\x—x'\<risr 

The sequence of functions Qn increases with N and tends uniformly to Q as N — )■ 00. If 
R G TZn and x G R, then 

11/ - l R fh P{ R,a) > tt N (x)\\f - l R f\\ LP[R) . 

The main change in the proof is that the function ip^ should be replaced with ip' N := Vt^ip^. 
Other details are left to the reader. 



1.3.2 Proof of the upper estimates 



The proof of Theorems 1.1.5 (and 1.1.7) is based on the actual construction of an 



asymptotically optimal sequence of block partitions. To that end we introduce the notion 
of a local block specification. 

Definition 1.3.4. (local block specification) A local block specification on a block Ro 
is a (possibly discontinuous) map x 1— > R(x) which associates to each point x G Rq a block 
R(x), and such that 

- The volume \R{x)\ is a positive continuous function of the variable x G Rq. 

- The diameter is bounded : sup{diam(_R(x)) ; x G Rq\ < 00. 
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The following lemma shows that it is possible to build sequences of block partitions of 
Ro adapted in a certain sense to a given local block specification. 

Lemma 1.3.5. Let R be a block in iR d and let x h-> R(x) be a local block specification 
on R . Then there exists a sequence (V n ) n >i of block partitions of Ro, V n = V\ U V%, 
satisfying the following properties. 

- (The number of blocks in V n is asymptotically controlled) 



i im #( p »> 



[ \R(x)\- l dx. (1.36) 

JRo 



(The elements of V\ follow the block specifications) For each R £ V\ there exists 
y £ Ro such that 

R is a translate of n~ 2 R(y), and \x — y\ < ^-^ — i^9_ j or a \i x £ R. (1-37) 



n 



- (The elements ofV 2 have a small diameter) 



lim n 2 sup diam(i?) I = 0. (1.38) 

n ~ Kx) \ Revl ) 

Proof: See Appendix. o 

We recall that the block R , the exponent p and the function / £ C m (Ro) are fixed, 



and that at each point x £ Ro the polynomial n x £ H m is defined by (1.27). The sequence 
of block partitions described in the previous lemma is now used to obtain an asymptotical 
error estimate. 

Lemma 1.3.6. Let x i— > R(x) be a local block specification such that for all x £ Rq 

Iks - ^R(x){^x)\\lp{R{x)) < 1- (1.39) 



Let (V n ) n >i be a sequence of block partitions satisfying the properties of Lemma 1.3.5, and 
let for all N > 

n(N) := max{n > 1 ; #(P n ) < N}. 
Then IZ^ '■= V n [N) is an admissible sequence of block partitions and 

limsupJVt||/-I^/|| £p(flo) < ( / Rix^dxY . (1.40) 



Proof: Let n > an d let R £ V„, If R £ V x n then let y £ R be as in fll.37[ ). Using 
Proposition 1.3.1 and ( |1.11[ ) we find 



\\f -^Rf\\LP(R) < hy ~ Ifl7r v ||xP(H) + \\{f ~ ^y) ~ Ir{I ~ ^ V )\\lv{R) 

< rT^WiTy - l R ( y ) -Ky\\Lv(R( y )) + C\R\* diam(i?) m || d m f - d m 7r y \\ L oc(R) 

2d „ 2d , 



< n- + Cn--\R(y)\- P dmm(R(y)) m \\d m f-d m f(y)\\ L ~ {R) 

< n~^(l + CV*(^ 1 diam( J R ))), 
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where we defined C := C sup gM \R(y)\p diam(i?(y)) m , which is finite by Definition 



1.3.4 



We denoted by the modulus of continuity of the m-th derivatives of / which is definec 



at (1.32). We now define for all n > 1, 

S n := n 2 sup diam(i?). 



Rev* 



According to ( |l.38[ ) one has 5 n — > as n — > oo. If R £ V 2 , then diam(i?) < n 5 n and 

l 

v 



therefore \R\ < diam(i?) d < n 2d 5 d . Using again (1.30), and recalling that - = ^ + i we 
find 

1 2f/ — 

11/ - I*/||rp(B) < Cl^i diam(i?) m ||rf m /|| iOO( « , < C'n^bl 
where C" = C\\d m f\\ ^oo(fl ) . From the previous observations it follows that 

\\f-Iv n f\\ LP{ R») < #(V n )imaf\\f-I R f\\ LP{R) 



Hence, 



< #(V n )pn-¥ max{l + C'u^rC 1 diam(i? )), C"S^}. 
limsup#(P n )"pra^||/-Ip n /|| LP(Ro) < 1. 



Combining the last equation with (1.36), we obtain 



limsup#(P n )f||/-I Pn /|| 



i p (i?o) 



< 



R(x) 1 dx 



Ro 



The sequence of block partitions TZn := P n (A0 clearly satisfies #(7Zn)/N — > 1 as iV — >• oo 



and therefore leads to the announced equation (1.40). Furthermore, it follows from the 



boundedness of diam(i?(x)) on Rq and the properties of V n described in Lemma 1.3.5 that 



SU P I H z ('Pn) d sup diam(_R) I < oo 
n>i V ReV n J 

which implies that TZn is an admissible sequence of partitions. 



We now choose adeq uate l ocal 
announced in Theorems 1.1. 5| and 
modified shape function 

K M M :-- 



jlock specifications in order to obtain the estimates 

= y/d we define the 



1.1.7 



For any M > diam(F 



inf 

|fl|=i, 

diam(i?)<A/ 



7T - 1rTt\\lp(R), 



;i.4i) 



where the infimum is taken on blocks of unit volume and diameter smaller that M. It 
follows from a compactness argument that this infimum is attained and that Km is a 
continuous function on H m . Furthermore, for any fixed 7r e H m , M h-> Km{k) is a 
decreasing function of M which tends to Ki(jr) as M — > oo. 



1.3. Proof of the approximation results 
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For all x G Rq we denote by R* M (x) a block which realises the infimum in Km{^x)- 
Hence, 



\ r *m( x )\ = 1) diam(R* M (x)) < M, and K m (tTx) = \Ux ~ Ir* 
We define a local block specification on i? as follows 

R M (x) := (K M (n x ) + M- x )-^R* M {x 



R* M (x) K x \\lp(R* m (x)) 



1.42) 



We now observe that using a change of variables and the homogeneity of ir x , as in (1.11), 
that 



Ir M (x) Kx\\lp(R m (x)) = Km(tt x )(Km(^x) + M 

>M\ 

-N )N>1 



< 1. 



Hence, according to Lemma 1.3.6, there exists a sequence (TZ^)n>i of block partitions of 
Ro such that 



limsupiV™ ||/ - I u m f\\ L p(R ) < \\KmM + M 1 \\l^(r )- 

jV->oo N 

Using our previous observations on the function Km, we see that 

lim \\K M (ir x ) + M~ 1 || x . (Bo ) = WKj^Wlt^. 

Hence, given e > we can choose M(e) large enough in such a way that 
\\K M{£) (n x ) + M(ey 1 \\ L r {Ro) < \\K I (7r x )\\ L r^ Ro) + e, 



which concludes the proof of the estimate (1.19) of Theorem 1.1.5 
For each N let M(N) be such that 

N^\\f-\ M(N) f\\ LP{Ro) < llKMwM + MiN^Urw + MiN)- 1 

/V JV 

and M(N) — > oo as N — > oo. Then the (perhaps non admissible) sequence of blo ck par- 
titions TZn '■= 1Z K ^ N ^ satisfies ( |l.l8 ) which concludes the proof of Theorem 



1.1.5 



o 



We now turn to the proof of Theorem |1.1.7[ which follows the same scheme for the 
most. There exists d functions Xi(x), ■ ■ ■ , Xd(x) G C°(Ro), and a function x tt*(x) G 
such that for all x E Ro we have 

Kx = \t(x)X? + TT*(x). 



Ki<d 



The hypotheses of Theorem 



1.1.7 



state that Ki (^^r 1 ) = Kfa) ^ for all x G E . It 
follows from Propositions 1.2.7| and 1.2.8 that the product Xx(x) ■ ■ ■ Xd(x) is nonzero for 
all x G Ro- We denote by Si G {±1} the sign of Aj, which is therefore constant over the 
block R , and we define 

Ki<d 
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The proofs of Propositions |1.2.8| and 1.2.7 show that there exists a block R £ , satisfying 



Rr 



1, and such that K I {n £ 



\TT 



matrix of entries |Ai(x) 



Ir £ tt\\lp(r £ )- By D(x) we denote the diagonal 



Clearly det > 



1 and TT r o 



|Arf(x)|, and we define 
4> x := (det D(x))^D(xy™ 
c = (det D(x))^7T £ + tt*(x) o 



and ir*(x) o <p x e P£. Hence 



using (1.5) we obtain 

W^x ~ I<t> x (R e ) K x \\ L P{<t> x {R e )) 



\\ir x o cf) x - 1 Rs (tt x o (j) x ) \\ L p( Re ) 

(detD(x))3||7r e - l Re 7i £ \\ LP{Re) 

{det D(x))^Kj(n e ) 
Ki(ir x ). 



We then define the local block specification 

R(x) := Kj(ti x ) 



x{Re 



1.43) 



in such way that — || ^(^(a;)) = 1 for all x E Rq, using the homogeneity of n x 

and an isotropic change of variables. The admissible sequence {TZn)n>i of block parti- 
tions constructed in Lemma 1.3.6 then satisfies the optimal upper estimate (1.18), which 
concludes the proof of Theorem 1.1. 7[ o 



Remark 1.3.7 (Adaptation to weighted norms). Lemma 1.3.6 also holds if (1.39) is 
replaced with 

Q(x)\\n x -Ir( x )(k x )\\lp(r(x)) < 1 



and if the L P (R ) norm is replaced with the weighted L P (R , f2) norm in (1.40). Replacing 
the block Rm(x) defined in (1.42) with 

R' M {x) := Sl(x)-iR M (x), 



one can easily obtain the extension of Theorem 1.1.5 to weighted norms. Similarly, re- 
placing R{x) defined in (1.43) with R'{x) := Vt(x)~ dR(x), one obtains the extension of 
Theorem 1.1. 7 to weighted norms. 



1.4 Appendix : Proof of Lemma 1.3.5 



By Q n we denote the standard partition of Ro G 1R in n identical blocks of diameter 
n -1 diam(i? ) illustrated on the left in Figure [TTTj For each Q e Q n by xq we denote the 
barycenter of Q and we consider the tiling 7q of lR d formed with the block u~ 2 R(xq) and 
its translates. We define V^(Q) and V\ as follows 



V\{Q) :={ReT Q ; RCQ} and V\ 



U V n(Q)' 

QeQ„ 



1.4. Appendix : Proof of Lemma 



1.3.5 
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~-fm 

I I I lllllllllllllllllllllllllll 1 1 1 

Figure 1.1 - (Left) the initial uniform (coarse) tiling Q 3 of Rq. (Right) the set of blocks 
V\ in green and the set of blocks V%* in red. 



Comparing the areas, we obtain 



QeQn QeQn 



\Q\ 



\n 



-2 



R{xq)\ 



n 



2d 



QeQ„ 



From this point, using the continuity of i 4 |i?(x)|, one easily shows that 



J Rq \R(x)\ 1 dx as n — > oo. Furthermore, the property (1.37) clearly holds. In order to 
construct V%, we first define two sets of blocks V%*(Q) and V%* as follows 

Vl*(Q) :={RnQ ; ReT Q and int(i2) n dQ ^ 0} and := \J P?(Q). 

QeQ n 

Comparing the surface of dQ with the dimensions of R(xq), we find that 

where C is independent of n and of Q G Q n . Therefore, < Cn 2d ~ l . The set 

of blocks V\ is then obtained by subdividing each block of V%* into o(n) (for instance, 
|_ln(n)J d ) identical sub-blocks, in such a way that #("P 2 ) is o(n M ) and that the requirement 
(fL~38j) is met. 
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2.1 Introduction. 

2.1.1 Optimal mesh adaptation 

In finite element approximation, a usual distinction is between uniform and adaptive 
methods. In the latter, the elements defining the mesh may vary strongly in size and 
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shape for a better adaptation to the local features of the approximated function /. This 
naturally raises the objective of characterizing and constructing an optimal mesh for a 
given function /. 

Note that depending on the context, the function / may be fully known to us : either 
through an explicit formula or a discrete sampling ; or observed through noisy measure- 
ments ; or implicitly defined as the solution of a given partial differential equation. 

In this chapter, we assume that / is a function defined on a polygonal bounded domain 
(let 2 . For a given conforming triangulation T of Q and an arbitrary but fixed integer 
m > 1, we denote by I™ -1 the standard interpolation operator on the space of Lagrange 
finite elements of degree m—1 associated to T. Given a norm X of interest and a number 
N > 0, the objective of finding the optimal mesh for / can be formulated as solving the 
optimization problem 

# mng|/-i r i/|U, 

where the minimum is taken over all conforming triangulations of cardinality N. We 
denote by Tiv the minimizer of the above problem. 

Our first objective is to establish sharp asymptotic error estimates that precisely des- 
cribe the behavior of ||/ — I™ -1 f\\x as N — > +oo. Estimates of that type were obtained 
in [4j|27] in the particular case of linear finite elements (m — 1 = 1) and with the error 
measured in X = LP . They have the form 



limsupfiV min ||/ - I^/||lp) < C y/\tet{#f)\ , - := - + 1, (2.1 

N^+oo V #{T)<N J L r T J) 



I 1 

P 



which reveals that the convergence rate is governed by the quantity y/\ det(d 2 f)\, which 
depends nonlinearly the on Hessian d 2 f. This is heavily tied to the fact that we allow 
triangles with possibly highly anisotropic shape. In the present work, the polynomial 
degree m — 1 is arbitrary, and the quantities governing the convergence rate will therefore 
depend nonlinearly on the m-th order derivative d m f. 

Our second objective is to propose simple and practical ways of designing meshes 
which behave similarly to the optimal one, in the sense that they satisfy the sharp error 
estimate up to a fixed multiplicative constant. 



2.1.2 Main results and layout 

We denote by H m the space of homogeneous polynomials of degree m : 

H m := Span{a;V ; k + I = m}. 

For any triangle T, we denote by I™ -1 the local interpolation operator acting from C°(T) 
onto the space P m _i of polynomials of total degree m — 1. The image of v G C°(T) by 
this operator is defined by the conditions 

15T 1 «(7) = «(7) 

for all points 7 G T with barycentric coordinates in the set {0, — — ^j, • • • ,1}. We 
denote by 

e m ,T(»p : = \\ v - ^t~ 1v \\lp{t) 
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the interpolation error measured in the norm L P (T). We also denote by 



e m ,T{v) p 



\v - r- 1 



r v \\L p 



E 

\TeT 



e m ,T(v 



the global interpolation error for a given triangulation T, with the standard modification 
if p = oo. 

A key ingredient in this chapter is a function defined by a shape optimization problem : 
for any fixed 1 < p < oo and for any ix G H m , we define 



inf e m . T (7r) p . 

|T|=1 



(2.2) 



Here, the infimum is taken over all triangles of area \T\ = 1. Note that from the homoge- 
neity of it, we find that 

(2.3) 



inf e m T (ir) p = K m ^(ix)A™ + l . 

\T\=A 



This optimization problem thus gives the shape of the triangles of a given area which 
is best adapted to the polynomial n in the sense of minimizing the interpolation error 



measured in LP. We refer to K m ^ p as the shape function. We discuss in £2.2 the main 
properties of this function. 



Our asymptotic error estimate for the optimal triangulation is given by the following 
theorem : 

Theorem 2.1.1. For any bounded polygonal domain Q C 1R 2 and any function f 6 
C m (fl), there exists a sequence of triangulations (1~n)n>n , conforming if p < oo, with 
#(T N ) < N, such that 



limsupiV 2 e mj r N (f) P < 



K 



rn.p 



d m f 



L T (f2) 



1 



T 



m 1 
~2 + p' 



(2.4) 



An important feature of this estimate is the "limsup" asymptotical operator. Recall 
that the upper limit of a sequence (un)n>n is defined by 

limsup Mat := lim sup u n 

N^oo N^oo n>N 



and is in general strictly smaller than the supremum sup Ar>A r o u^. It is still an open 
question to find an appropriate upper estimate for sup Ar>A r o N ^ e m j- N {f) p when optimally 
adapted anisotropic triangulations are used. 



In the estimate (2.4), the m-th derivative d m f(z) is identified to a homogeneous poly- 
nomial in H m : 

d m f(z) 



ml 



Ed m f x k y l 
d k xd l y^I\l\' 

k+l=m a 



(2.5) 
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In order to illustrate the sharpness of (2.4), we introduce a slight restriction on sequences 

a sequence (Tn)n>n °f triangulations, such 



of triangulations, following an idea in [4] 
that #(7jv) < ^V") is said to be admissible if 



In other words 



sup I N 2 sup diam(T) I < oo. 

N>Nq V Ter N J 



sup diam(T) < C A N- 1/2 

TgTn 



(2.6) 



for some Ca > independent of N. The following theorem shows that the estimate (2.4) 



cannot be improved when we restrict our attention to admissible sequences. It also shows 



that this class is reasonably large in the sense that (2.4) is ensured to hold up to small 
perturbation. 

Theorem 2.1.2. Let Q C M 2 be a bounded polygonal domain, and f G C m (Q). Set 
- := y + K For all admissible sequences of triangulations (Tn)n>n , conforming or not, 
one has 

— / ( J 

liminf N * e m ,T N {f) p > ' 

AT— »oo 



K„ 



m,p 



ml 



For all e > 0, there exists an admissible sequence of triangulations (J~n)n>n > 
if p < oo, such that 

'oTf 



conforming 



lim sup N 2 e m ,T§ (f) P < 



K 



rn.p 



ml 



+ £. 



L T (VL) 



Note that the sequences (T^)n>n satisfy the admissibility condition (2.6) with a 
constant Ca{s) which may explode as e — > 0. The proofs of both theorems are given 
in §2.3[ These proofs reveal that the construction of the optimal triangulation obeys 
two principles : (i) the triangulation should equidistribute the local approximation error 
e m,T(f) P between each triangle, and (ii) the aspect ratio of a triangle T should be isotropic 
with respect to a distorted metric induced by the local value of d m f on T (and there- 
fore anisotropic in the sense of the Euclidean metric). Roughly speaking, the quantity 
||-^m,p ("^i ) \\l t (t) controls the local interpolation L p -error estimate on a triangle T once 
this triangle is optimized with respect to the local properties of /. This type of estimate 
differs from those obtained in \2\, which hold for any T, optimized or not, and involve the 
partial derivatives of / in a local coordinate system which is adapted to the shape of T. 



The proof of the upper estimates in Theorem |2.1.2| involves the construction of an 
optimal mesh based on a patching strategy similar to j4). However, inspection of the proof 
reveals that this construction becomes effective only when the number of triangles N 
becomes very large. Therefore it may not be useful in practical applications. 

A more practical approach consists in deriving the above mentioned distorted metric 
from the exact or approximate data of d m f using the following procedure. To any 7r G H to , 
we associate a symmetric positive definite matrix h n G S^- If z G f2, and d m f(z) is close 
to 7r, then the triangle T containing z should be isotropic in the metric h n . The global 
metric is given at each point z by 



h(z) = s(n z )h nz 



71% 



d m f(z) 
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where s(tt z ) is a scalar factor which depends on the desired accuracy of the finite element 
approximation. Once this metric has been properly identified, fast algorithms such as 
in 19,92,94 can be used to design a near-optimal mesh based on it. Recently in 15,66 



several algorithms have been rigorously proven to terminate and produce good quality 
meshes, see also Chapter 5. Computing the map 



(2.7) 



is therefore of key use in applications. This problem is well understood in the case of linear 
elements (m = 2) : the matrix h n is then defined as the absolute value (in the sense of 
symmetric matrices) of the matrix associated to the quadratic form tt. In contrast, the 
exact form of this map in the case m > 3 is not well understood. 

In this chapter, we propose algebraic strategies for computing the map (2.7) for m = 3 



which corresponds to quadratic elements. These strategies have been implemented in 
an open-source Mathematica code 95 . In a similar manner, we address the algebraic 



computation of the shape function K m ^(ir) from the coefficients of it £ H m , when m > 3. 



All these questions are addressed in §2.4 2.5 and 2J3 

In £2.4, we discuss the particular case of linear (m = 2) and quadratic (m = 3) 
elements. In this case, it is possible to obtain explicit formulas for K mp (ir) from the 
coefficients of tt. In the case m = 2, this formula is of the form 



K 



■2,P 



ax' 



+ 2bxy + cy 2 ) = a^/\b 2 - 



ac\ 



where the constant a only depends on p and the sign of b 2 —ac, and we therefore recover the 
known estimate (2.1 ) from Theorem 2.1.1 The formula for m = 3 involves the discriminant 



of the third degree polynomial d 3 f. Our analysis also leads to an algebraic computation 



of the map (2.7). We want to mention that a different strategy for the construction of the 



distorted metric and the derivation of the error estimate for a finite element of arbitrary 
order was proposed in 23 . In this approach, the distorted metric is obtained at a point 



z E by finding the largest ellipse contained in a level set of the polynomial associated to 



d m f(z) by (2.5). This optimization problem has connections with the one that defines the 



shape function in (2.2), as we shall explain in £2.2 The approach proposed in the present 



work in the case m = 3 has the advantage of avoiding the use of numerical optimization, 
the metric being directly derived from the coefficients of d m f. 

In £2.5 we address the case m > 3. In this case, explicit formulas for K m ^ p {n) seem out 
of reach. However, we can introduce explicit functions K m (7r) which are polynomials in 



the coefficients of tt, and are equivalent to K mjP (Tr), leading therefore to similar asymptotic 
error estimates up to multiplicative constants. At the current stage, we did not obtain a 
simple solution to the algebraic computation of the map (2.7) in the case m > 3. The 
derivation of K m is based on the theory of invariant polynomials due to Hilbert. Let us 



mention that this theory was also recently applied in 79 to image processing tasks such 
as affine invariant edge detection and denoising. 



We finally discuss in £2^3 the possible extension of our analysis to simplicial elements 
in higher dimension. This extension is not straightforward except in the case of linear 
elements m = 2. 
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2.2 The shape function 

In this section, we establish several properties of the function K mp which will be of 
key use subsequently. We assume that to > 2 is an integer, and p G [1, oo]. We equip the 
finite dimensional vector space H m with a norm || ■ || defined as follows 

||7r|| := sup \ir(x,y)\. (2.8) 

x 2 +y 2 <l 

Our first result shows that the function K mp vanishes on a set of polynomials which has 
a simple algebraic characterization. 

Proposition 2.2.1. We denote by s m := + 1 the smallest integer strictly larger than 
m/2. The vanishing set of K m p is the set of polynomials which have a generalized root of 
multiplicity at least s m : 

K m , P (T!') = ^ ir(x,y) = (ax + f3y) Sm n, for some a, (3 G K and n G H m - Sm . 

Proof: We denote by T cq a fixed equilateral triangle of unit area, centered at 0. 

We first assume that n(x : y) = (ax + (5y) Sm Ti. Then there exists a rotation R G 2 
and 7T G H m _ Sm such that 

( m-Sr n, 
a i x i y m ~ Sm ~' 1 
i=0 

Therefore, denoting by (fi £ the linear transform (f> £ (x,y) = R (ex, |), we obtain 



m-s„. 

i„.m—s m —i 



7io<f) £ = (ex) Sm ai(ex) l (y/e) m - Sm ~ l = £ 2s ™- m x Sm 

V i=0 / \ i=0 

hence n o e — y as e — > 0. Since |det0 £ | = 1, the triangles e (T eq ) have unit area. 
Consequently, 

e m ,0 e (T oq )(7r)p = e m , Tcq (vr o <f) £ ) p ^ as e ->■ 0, 

and therefore K mtP (ir) = 0. 

Conversely, let n G H m \ {0} be such that K m ^ p (-n) = 0. Then there exists a sequence 
(T n ) n >o of triangles with unit area such that e mt T n (^) p — > 0. We remark that the interpo- 
lation error e T (n) p of n G H m is invariant by a translation r h : z ^ z + h of the triangle 
T. Indeed, 7T — n o G P m -i, so that 

||7T - I™" 1 7T|| iP ( Th ( T )) = \\tT OT h - I™ _1 (7T O r/i)||iP(T) = 1 1 VT - I™ _1 7T || LP ( T ) . (2.9) 

Hence we may assume that the barycenter of T n is and write T n = 0„(T cq ) for some 
linear transform <p n with det n = 1. Since e m ^(-) P is a norm on H m , it follows that 

7T O 0„ ->■ 0. 

The linear transform <p n has a singular value decomposition 
(pn = U n o D n o V n , where U n , V n G £> 2 , and D n = ( £ ™ ^ V < e n < 1. 
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Note that \\ir o V\\ = \\ir\\ for any ir G H m and V G O^- Therefore, 

\\ir o U n o D n \\ = \\n o U n o D„ o V n o V~ || = || 7r o (p n \\ — >■ 0. 

Denoting by a l)Tl the coefficient of x % y m ~ % in 7r o U n , we find that ai jn e^~ m tends to as 
n — > +oo. In the case where i < s m , this implies that a^ n tends to as n — » +oo. 

By compactness of O2 we may assume, up to a subsequence extraction, that U n 
converges to some U G Oi. Denoting by the coefficient of x % y m ~ l in n o U, we thus 
find that ai = if % < s m . This implies that it o U(x, y) = x Sm n(x, y) which concludes the 
proof. o 



Remark 2.2.2. In the simple case m = 2, we infer from Proposition 2.2.1 that K2 )P (ti) = 
if and only if it is of the form Tr(x,y) = x 2 up to a rotation, and therefore a one- 
dimensional function. For such a function, the optimal triangle T degenerates to a segment 
in the y direction, i.e., optimal triangles of a fixed area tend to be infinitely long in one 
direction. This situation also holds when m > 2. Indeed, we see in the second part in the 
proof of Proposition 2.2.1 that if -n is a nontrivial polynomial such that K m ^ p {ii) = 0, then 
e n must tend to as n — )■ +00. This shows that T n = <j> n (T) tends to be infinitely flat in 
the direction Ue y with e y 



(0,1). However, K mp (7r) = does not any longer mean that 



ii is a polynomial of one variable. 

Our next result shows that the function K mp is homogeneous and obeys an invariance 
property with respect to linear change of variables. 

Proposition 2.2.3. For all n G B m , A G R, and <p e C(R 2 ), 



K mjP (\7r) 

K m)P (TI O (f)) 



|A|i^ m , p (7r), 
|det</>r /2 ^ m , p (7r) 



(2.10) 
(2.11) 



Proof: The homogeneity property (2.10) is a direct consequence of the definitions of K m>p . 
In order to prove the invariance property (2.11 ), we assume in a first part that det 7^ 0, 



and we define T :- 



a/I det 0| 



and n(z) := vr( y/\ det J\z) = | det 0| m / 2 vr(; 



We now remark that the local interpolant I™ 1 commutes with linear change of va- 



riables in the sense that, when is an invertible linear transform, 



I 



m— 1 , 



, T y v o <p) = (I-"} V) O (2.12) 
for all continuous functions v and triangles T. Using this commutation formula we obtain 



det0r 1/?, e 



7T 



det^ 2 e m An) p . 



Since the map T 1— > T is a bijection of the set of triangles onto itself, leaving the 

is invertible. When det <t> 



area invariant, we obtain the relation (2.11) when 



0, the 
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polynomial 7ro0 can be written (ax + f3y) m so that K m ^ p (TT o 0) = by Proposition 2.2.1 o 



The functions K m>p are not necessarily continuous, but the following properties will 
be sufficient for our purposes. 

Proposition 2.2.4. The function K m ^ is upper semi- continuous in general and conti- 
nuous if m = 2 or m is odd. Moreover, the following property holds : 

If n n tx and Kra^ijin) — > 0, then K m ^(ir) = 0. (2-13) 

Proof: The upper semi- continuity property comes from the fact that the infimum of 
a family of upper semi-continuous functions is an upper semi-continuous function. We 
apply this fact to the functions tt i— > e m ^(^ s ) P indexed by triangles which are obviously 
continuous. 

For any p olyn omial tt G H2, tt = ax 2 + 2bxy + cy 2 , we define det tt = ac — b 2 . It will 



be shown in ? ^41 that K 2 , p (tt) = o p sJ | det 7r|, where a p only depends on the sign of det7r. 



This clearly implies the continuity of K 2 . p . We next turn to the proof of the continuity of 
K mtP for odd m. Consider a polynomial tt e H m . If i^ mp (7r) = 0, then the upper semi- 
continuity of K m ^p, combined with its nonnegativity, implies that it is continuous at tt. 
Otherwise, assume that K mjP (ir) > 0. Consider a sequence TT n e H m converging to tt and 
a sequence <f) n of linear transformations satisfying det <f) n — 1, and such that 

lim e^iT ^n) = liminf K m>p (-K*) := lim inf K mp (TT*). 

n— >+oo tt*— >tt r— >0 \\n*—n\\<r 

If the sequence (fi n admits a converging subsequence 4> nk — > (ft, it follows that 
K m ^Tr) < e 0(rcq) (vr) = lim e (Tcq) (7r„J = liminf K m , p {ir*). 

fe— > + 00 K TT*^r1X 

This asserts that K m p is lower semi-continuous at tt, and therefore continuous at tt since 
we already know that K m ^ p is upper semi-continuous. 

If (p n does not admit any converging subsequence, then we invoke the Singular Value 
Decomposition (SVD) n = U n oD n oV n , where U n , V n G O2 and D n = diag(e ri , — ), where 
< s n < 1. (Here and below, we use the shorthand diag(a, b) to denote the diagonal 
matrix with entries a and b.) The compactness of O2 implies that U n admits a converging 
subsequence U njt — > U . In particular, TT nk o U nk converges to tt o U . Therefore, denoting by 
a i)Tl the coefficient of x l y m ~ % in TT n oU n , the subsequence a i)nk converges to the coefficient 
<2j of x l y m ~ l in tt o [/. Observe also that e„ — )■ ; otherwise, some converging subsequence 
could be extracted from <p n . Since e0 n (r cq )(vr n ) = eT cq (vr n o0 n ), the sequence of polynomials 
TTn 0n is uniformly bounded and so is the sequence TT n oU n o D n . Therefore, the sequences 
(aj jri £^~ m ') n >o are uniformly bounded. It follows that = when i < y. Since m is odd, 
this implies that tto[J(x, y) = x Sm Tf(x, y), and Proposition 2.2.1 implies that K m ^ p (Tr) = 0, 
which contradicts the hypothesis K mtP {rt) > 0. 



Finally, we prove property (2.13). The assumption K m ^ p (TT n ) — > is equivalent to the 



existence of a sequence T n = n (T eq ) with det n = 1 such that em^T n {^n) P — > 0. Reasoning 



in a similar way as in the proof of Proposition 2.2.1 , we first obtain that tt u o n — y 0, and 



we then invoke the SVD decomposition of (fi n to build a converging sequence of orthogonal 



2.2. The shape function 



79 



matrices U n — > U and a sequence < e n < 1 such that if aj in is the coefficient of x l y m ~ l 
in 7r n o U n , we have a^ n e%~ m — > 0. When i < s m , it follows that a i)n — > 0, and therefore 
7r o U(x, y) = x Sm 7r(x,y). The result follows from Proposition 2.2.1 o 



in 



We finally make a connection between the shape function and the approach developed 

(2.14) 



23 . For all n G H m , we denote by the level set of \n\ for the value 1 

{(x,y)ER 2 , \ir{x,y)\<l}. 



A, 



We now define 



sup 



\E\liz 



-m/2 



(2.15) 



where the supremum is taken over the set £ of all ellipses centered at 0. We use a bold 
font to denote the numerical constant 7T = 3.14159... The optimization among ellipses 
defining can be rephrased as an optimization on the cone of 2 x 2 positive symmetric 
matrices. 



K^(tt) = inf{(detM)T ; M G 5+ and Vz G M 2 , (Mz,z) > |7r(z)| 2 / m }. 



(2.16) 



The minimizing ellipse E* is then given by {(Mz,z) < 1}. The optimization problem 
described in (2.16) is quadratic in dimension 2 and subject to (infinitely many) linear 
constraints. This apparent simplicity is counterbalanced by the fact that it is noncon- 
vex. In particular, it doe s not have unique solutions and may also have no solution. We 
construct in Chapter 6, ^ 6.5.3 a variant of K £ which is defined by by a well posed 
optimization problem, for which the optimal matrix is unique and continuously depends 
on the parameter ix. 

Proposition 2.2.5. On M m , one has the equivalence 

cK £ m < K m , p < CKl 
with constant < c < C independent of p. 

Proof: We consider a fixed triangle T* of unit area, for instance an equilateral triangle, 
for each exponent 1 < p < oo on H m we define a norm || • 

IMIp := II 71 " - ItT 1 n \\LP(T«), 

in such way that 



| p on H m as follows 



inf II 7r o 

|det</>|=l 



\P' 



(2.17) 



where the infimum is taken among the collection of linear changes of variables satisfying 

Idet^l 



1. Using (|2.16|) and the homogeneity of n we obtain a similar expression for the 

inf \Wo<j>\\. (2.18) 

|det</>|=l 
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Indeed if G GL d satisfies |det0| = l,thenM := ||vro0|| m (0" 1 ) T 0^ 1 satisfies (detM) t = 
||7ro0|| and (Mz, z) > \it(z)\m. Conversely to any M 6 S 2 + we associate = M~5 (det M)j 
which satisfies det 0=1. 

Since the vector space H m has finite dimension there exists C > c > such that for 
any tt G H m and any exponent 1 < p < oo one has 



ell 71 "!! < IMIl ^ 1 1 7T 1 1 ,„ < II 7T II oo < C||7r||. 



Combining this with (2.17) and (2.18) we obtain the announced result. 



Remark 2.2.6. Since K mp and are equivalent, they vanish on the same set, and 



therefore Proposition 2.2.1 is also valid for K^. It also easy to see that satisfies the 



homogeneity and invariance properties stated for K m ^ in (2.10) and (2.11), as well as the 
continuity properties stated in Proposition 2.2.J\ 



Remark 2.2.7. The continuity of the functions K m „ and K!L can be established when m is 



odd or equal to 2, as shown by Proposition \2.2.J\ but seems to fail otherwise. In particular, 
direct computation shows that Kf(x 2 y 2 —ey 4 ) is independent of e > and strictly smaller 
than Kf(x 2 y 2 ). Therefore, Kf is upper semi- continuous but discontinuous at the point 



x 2 y 2 G B A . 



2.3 Optimal estimates 

This section is devoted to the proofs of our main theorems, starting with the lower 



estimate of Theorem |2.1.2[ and continuing with the upper estimates involved in both 
Theorem |2"XD and I2JL21 

Throughout this section, for the sake of notational simplicity, we fix the parameters 
m and p and use the shorthand 

K = K m , p and e T (vr) = e rnjT (n) p . 

For each point z G Q we define 

vr z := r- G lti m , 

ml 

where / G C m (Q) is the function in the statement of the theorems. We denote by 

u(r) := sup ||7r 2 — n z > || 

\\z—z'\\<r 



the modulus of continuity of z h- > tt z with the norm || ■ || defined by (2.8). Note that 
u)(r) — > as r —¥ 0. 
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2.3.1 Lower estimate 

In this proof we will use an estimate from below of the local interpolation error. 

Proposition 2.3.1. Assume that 1 < p < oo. There exists a constant C > 0, depending 
on f and Q, such that for all triangles TcO and z G T , 

e T (f) p > K p {tx z )\T\^ +1 - C(diamT) mp |T|w(diamT). (2.19) 

Proof: Denoting by fi z G P m the Taylor development of / at the point z up to degree 
m, we obtain 

f(z + u) - ii z {z + u) =m f (n z+tu (u) - ir z (u))(l - t) m ^dt, 

Jt=o 

and therefore 

11/ - Hz\\l°°(t) < C diam(T) m u;(diam(T)), 

where Co is a fixed constant. By construction 7r 2 is the homogenous part of \i z of degree 
m, and therefore \x z — ti z G P m -i- It follows that for any triangle T, we have 

Hz-V?- 1 liz=nz-V?- 1 irz. (2.20) 

We therefore obtain 

|er(/)-e T (7r z )| < ||(/ - f) - {it z - I™ -1 -k z )\\lp{t) 

< \T\^\\(f - I-- 1 /) - (ii z - I™- 1 fl z )\\ L ^ T) 

= mV'wii-Trw-riWL-m 

< C 1 |T| 1 ^||/- / x z || ioo(T) 

< C Ci|T| 1/p diam(T) m w(diam(T)), 

where C\ is the norm of the operator / — I™ -1 : C°(T) — » C°(T) in L°°(T) norm which is 
independe nt o f T. 

From (2.3) we know that er^z) > \T\^ + pK(jr z ), and therefore 

e T (f) > K(tx z )\T\^ + v -C C 1 |T| 1 / p diam(T) m w(diam(T)). 

We now remark that for all p G [1, oo) the function r i-> r p is convex, and therefore if a, 6, c 
are positive numbers, and a > b — c, then a p > max{0, b — c} p > IP — pcb" 9 ^ 1 . Applying 
this to our last inequality we obtain 



e T {f) p > K p {7r z )\T\^ +1 - pC Ci(K(7T z )) p - 1 \T\ ( - p - 1) ^ + p )+ i diam(T) m u;(diamT). 
Since \T\ (p ~ 1){ ^ + r )+ p = {T^-^f \T\ < (di&mT) m ^\T\, this leads to 
e T {f) p > K p {tt z )\T\^ l+1 -C(diamT) mp |T|a;(diamT), 



where C := pC Ci(sup zen K(n z )) 



p-i 



o 
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We now turn to the proof of the lower estimate in Theorem 2.1.2 in the case where 
p < oo. Consider a sequence {Tn)n>n °f triangulations which is admissible in the sense 



of equation (2.6). Therefore, there exists a constant Ca such that 

diamT < C A N~ 1/2 , N > N , T G Tn- 



For T G Tn, we combine this estimate with ( |2.19 ), which gives 

e T (f) p > K p (tt z )\T\^ +1 - {CAN-V^^CuiCAN- 1 ' 2 ). 
Averaging over T, we obtain 

e T (f) p > [ K^Tt-^T^dz-^N-^C^CujtCAN- 1 ' 2 ). 
Jt 

Summing on all T G Tn, and denoting by the triangle in Tn containing the point 
z G Q, we obtain the estimate 



e TN (f) P > I K(7r z )\T z N \^dz-N-^e(N), 
Jn 



(2.21) 



where e(N) := \n\CYCuj{C A N^I 2 ) as N -»■ +oo. The function z h-> |Tf | is linked 
with the number of triangles in the following way : 

dz 



\T N \ 



On the other hand, with 



■ 2 + we have by Holder's inequality, 



1 m i 1 



r/p 



I J>JV 



1-r/p 



(2.22) 



Combining the above, we obtain a lower bound for the integral term in (2.21) which is 
independent of Tn ■ 

J K p (n z )\T?\^dz> (j K T (7T z )dzy T N~ mp/2 . 



p/r 

K T (ix z )dz\ -e(N) 



Inserting this lower bound into (|2.21|) we obtain 

erMY> 
This allows us to conclude 

hminfiVfe rjv (/) > ( / K T (rr z )dz 

TV— >+oo 



(2.23) 
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which is the desired estimate. 



The case p = oo follows the same ideas. Adapting Proposition 2.3.1, one proves that 

e T (f) > K(ir z )\T\? - C(diamT) m w(diamT), 

and therefore 

e TN (f) > \\K(7r z )\T z N \^\\ Loc{n) -N-fe(N), (2.24) 
where e(N) := C™Cuj(CaN~z) — > as N — > +oo. The Holder inequality now reads : 

1 



K(-K z ) m dz < K(ir z ) m \T. 



L°°(Q) 



\T Z N \ 



equivalently, 



\KM\T t 



A' I 



lL°°(f2) 



> / K(n z )™dz 



n 



Combining this with (2.24), we obtain to the desired estimate (2.23) with p 

_2_ 
m ' 



oo and 



T 



Remark 2.3.2. This proof reveals the two principles which characterize the optimal trian- 



gulations. Indeed, the lower estimate (2.23) becomes an equality only when both inequalities 



in (2.19) and (2.22) are equality. The first condition - equality in (2.19) - is met when each 
triangle T has an optimal shape, in the se nse that eri^z) = K(k z )\T\^ + p for some z E T. 
The second condition - equality in (2.22) - is met when the ratio between K p (7i z )\T^\^~ 
and jT^I -1 is constant, or equivalently K(% z )\T\^~ + p is independent of the triangle T. 
Combined with the first condition, this means that the error er(/) p is equidistributed over 
the triangles, up to the perturbation by (diamT) mp |T|o;(diamT) which becomes negligible 
as N grows. 



2.3.2 Upper estimate 



We first remark that the upper estimate in Theorem 2.1.2 implies the upper estimate in 
Theorem 2.1.1 by a sub-sequence extraction argument : if the upper estimate in Theorem 
2.1.2 holds, then for all e > there exists a sequence (T^)n>n , #(7at) < N, such that 

d m f 



\wa.suv[N™ e T e(f)) < 



K 



ml 



with i = - + y. We then choose a sequence (£n)n>n such that 



Nfe^ N (f)< 



K 



d m f 



ml 



+ 2e 



N 



for all N > N , and En — ^ as iV - >■ oo. Defining 7jv : = T^ N we thus obtain 

'd m f 



limsup(iV™e TN (/)) < 



K 



ml 
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which concludes the proof of Theorem |2.1.1 We are thus left with proving the upper 



estimate in Theorem 2.1.2 We begin by fixing a (large) number M > 0. We shall take 



the limit M — > oo in the very last step of our proof. We define 

T M = {T triangle ; |T| = 1, bary(T) = and diam(T) < M}, 

the set of triangles centered at the origin, of unit area and diameter smaller than M. 
This set is compact with respect to the Hausdorff distance. This allows us to define a 
"tempered" version of K = K m p that we denote by K M : for all 7r G H m 

K m (tt) := inf e T (ir). 

Since Tm is compact, the above infimum is attained on at least a triangle, that we denote 
by Tm (jr). Note that the map 7r i— y Tm(vt) need not be continuous. It is clear that Km{^) 
decreases as M grows. Note also that the restriction to triangles T centered at is artificial, 



since the error is invariant by translation as noticed in (2.9). Therefore, Km(tt) converges 
to K(tc) as M — y +oo. Since Tm is compact, the map n \-> maxy 6 T M e r{^) defines a norm 
on H m , and is therefore bounded by CmIMI for some Cm > 0. One easily sees that the 
functions n \-t e T (n) are uniformly CM-Lipschitz for all T G T M , and so is K M . 

We now use this new function Km to obtain a local upper error estimate that is closely 



related to the local lower estimate in Proposition 2.3.1 



Proposition 2.3.3. For z\ G Q, let T be a triangle which is obtained from Tm(7T Zi ) by 
rescaling and translation ( T — tT/v/(vr 2l ) + zq). Then for any Z2 G T , 

e T (f) < (K M (ir Z2 ) +B M u{ma X {\z 1 - z 2 \, diam(T)})) \T\^ + K (2.25) 

where B M > is a constant which depends on M. 
Proof: For all Zi,z% G £1, we have 

e T M ^, 1 )i. 7[ z 2 ) < e TM{7Tzi) (7i Zl ) + C M hz 1 -^z 2 \\ 

= K M {lt Zx ) + C M \\^zi — TTzaH, 

< K M (ir Z2 ) + 2C m \\k Zi - 7r Z2 \\, 

< Km(tt Z2 ) + 2C M u{\zx - z 2 \). 

Therefore, if T is of the form T = £Tm(7t Zi ) + zq, we obtain by a change of variable that 
e T (^ Z2 ) < (k m (tt Z2 ) + 2C m uj{\z 1 - ,2 2 |))|T|^. 



Let /j, z G P m be the Taylor polynomial of / at the point z up to degree m. Using (2.20) 
we obtain 

e T (f) < e T (fi Z2 ) + e T (f -/x Z2 ) 
= e T (ir Z2 ) + e T (f - /i Z2 ) 

< (K M {n Z2 ) + 2C M u{\ Zl - z 2 \))\T\^ + p +e T (f-fi 



22 J 
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By the same argument as in the proof of Proposition 2.3.1 we derive that 

e T (f - aO < C\T\p diam(T) m u;(diamT), 

and thus 

e T (f) < (^M(^ 2 ) + 2C A fO;(|z 1 -^|))|T|^ + p+C|T|pdiam(T) m u;(diamT). 
Since T is the scaled version of a triangle in Tm, it obeys diam(T) 2 < M 2 \T\. Therefore, 

e T (f) < (K M (7c Z2 ) + (2C M + CM m )u(m a x{\z 1 -z 2 \,dmm(T)}))\T\^ + l, 
which is the desired inequality with Bm '■= 2Cm + CM m . o 



For some r > to be specified later, we now choose an arbitrary triangular mesh 1Z 
of Q satisfying 

r > sup diam(_R). 

Our strategy to build a triangulation that satisfies the optimal upper estimate is to use 
the triangles R as macro- elements in the sense that each of them will be tiled by a locally 
optimal uniform triangulation. This strategy was already used in |2J. 
For all R G TZ we consider the triangle 

T R := {K M {7r bR ) + 2B M u(r))-^T M (Tr bR ), 

which is a scaled version of T M {ir bR ) where bn is the barycenter of R. We use this triangle 
to build a periodic tiling Vr of the plane : there exists c6l 2 such that T R U T' R forms a 
parallelogram of side vectors a and b, with T' R = c — Tr. We then define 

Vr := {T R + ma + nb ; m,n G 7L 2 } U {T' R + ma + nb; m,n G 7L 2 }. (2.26) 

Observe that for all it G H, m and all triangles T, T' such that T' = — T, one has 
er(7r) = eT'(vr), since 7r is either an even polynomial when m is an even integer, or an odd 
polynomial when m is odd. Since we already know that e-r(vr) is invariant by translation 
of T, we find that the local error er(vr) is constant on all T G Vr. 

We now define as follows a family of triangulations T s of the domain Q, for s > 0. 
For every R G 1Z, we consider the elements T D R for T G s"Pr, where sPr denotes the 
triangulation Vr scaled by the factor s. Clearly, {T n R, T G sP/?, R & 1Z and int(T fl 
i?) 7^ 0} constitute a partition of Q, up to a set of zero Lebesgue measure. In this partition, 
we distinguish the interior elements 

77 e s : = {T G sVr ; T C int(fl) , i? G 7Z}, 

which define pieces of a conforming triangulation, and the boundary elements T fl R 
for T G sPij such that int(T) fl dR ^ 0. These last elements might not be triangular, 
nor conformal with the elements on the other side. Note that for s > small enough, 
each R G 1Z contains at least one triangle in 7^ reg , and therefore the boundary elements 
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Figure 2.1 -a. An edge (thick) of the macro-triangulation 1Z separating to uniformly 
paved regions (T R is thick, V R is dashed), b. Additional edges (dashed) are added near 
the interface in order to preserve conformity, c. The sets of triangles 7^ reg (gray) and 7^ bd 
(white) 



constitute a layer around the edges of 1Z. In order to obtain a conforming triangulation, 
we proceed as follows : for each boundary element T fl R, we consider the points on its 
boundary which are either its vertices or those of a neighboring element. We then build 
the Delaunay triangulation of these points, which is a triangulation of T n R since it is a 
convex set. We denote by 7^ bd the set of all triangles obtained by this procedure, which 



is illustrated in Figure 2.1 



Our conforming triangulation is given by 

% = r; cg u 7; bd . 

As s — > 0, clearly, 

#(7; bd ) < Cms' 1 and ^ |T| < C hd s 

TeT^ d 

for some constant Cbd which depends on the macro-triangulation 71. We do not need to 
estimate Cbd since 7Z is fixed and the contribution due to Cbd in the following estimates is 
negligible as s — > 0. We therefore obtain that the number of triangles in 7^ bd is dominated 
by the number of triangles in 77 eg . More precisely, we have the equivalence 

#(T S ) ~ #(77 g ) ~ Yl = s ™ 2 E \ R K K mM + 2B M u(r)y, (2.27) 

Ren ' R > Ren 

in the sense that the ratio between the above quantities tends to 1 as s — > 0. The right- 
hand side in (2.27) can be estimated through an integral : 



* 2 #(77 es ) < J2\ R \( K ^n) + ^BMUj(r)y 
Ren 

= f (K M (7i bR ) + 2B M u(r)ydz 

Ren J R 

- E / ( k m(k z )+C m \\ n z — 7r fcfl || + 2B M u(r)ydz 
Ren^ R 



< / (K M (n z ) + (2B M + C M )u{r)ydz. 
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Therefore, since Cm < Bm, 



#(T S ) < s- 2 (J(K M (ir g ) + 3B M u{r)) T dz + C bd s\ . (2.28) 

Observe that the construction of T s gives a bound on the diameter of its elements 

sup diam(T) < sC a , C a := maxdiam(T R ). 



TeTs 



ReTl 



Combining this with ( 2.27| ), we obtain that 



sup diam(T) < C A #(%)~ 1/2 for all s > 0, 



which is analogous to the admissibility condition (2.6). 



We now estimate the global interpolation error ||/ — I™ -1 f\\jj, := (Xlrer e T(f) p )*, assu- 
ming first that 1 < p < oo. We first estimate the contribution of 7^ bd , which will eventually 
be negligible. Denoting by v z G P m _i the Taylor polynomial of / up to degree m — 1 at 
z, we remark that 

||/-It _1 /IU-(t) = Kl-^if-^h^T) < CiUZ-zvlU-m < CoCidiam^r, 

where C\ is the norm of I — I™ -1 in L°°(T) which is independent of T, and Co only 
depends on the L°° norm of d m f. Remarking that e T (f) = \\f - l^ 1 I\\lp(t) < I^HI/ - 
It" 1 /|U°°(t)j we obtain an upper bound for the contribution of 7^ bd to the error : 



mp 



erUY < CIC{ mdiam(T) 
TeT„ bd TeT. bd 



< C£Cf l T M SU P diam(T) mp 



TeT bd 



TeT, bd 



< C^CfC bd s sup diam(T) mp 

Tg7 ^d 

with C* d = C^CfC^ p C hd . We next turn to the the contribution of T s rc& to the error. If 
T G 77 cg , T C R & 71, we consider any point z\ = z G T and define ^2 = &r the barycenter 



of -R. With such choices, the estimate (2.25) reads 



m i 1 
V 



e T (f) < (Km(tt z ) + B M oj(max{r,C A s})) \T\? 

We now assume that s is chosen small enough such that C A s < r. Geometrically, this 
condition ensures that the "micro-triangles" constituting 7^ actually have a smaller dia- 
meter than the "macro-triangles" constituting TZ. This implies 

e T (f) p < (K m (tt z ) + B M uj{r)f |T|^ +1 . (2.29) 
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Given a triangle T G 7^ rcg , T C R G 1Z, and a point z E T, one has 

|T| = S 2 (ir M (7r fe J + 2 J B M u;(r))- r 

< s 2 (K M (n z ) - C m \\tt z - n bR \\ + 2B M u(r)y T 

< s 2 (K M (ir z ) + (2B M - C M )u(r))- T . 

Observing that Bm > Cm, and that p — = r, we insert the above inequality into the 



estimate (2.29), which yields 

e T (f) p < s mp (K M (ir z ) + B M u(r)) T \T\. 
Averaging on z G T, we obtain 

e T (f) p < s mp [ (K M (ft z ) + B M cu(r)) T dz. 
Jt 

Adding up contributions from all triangles in T s , we find 

er s UT= E e ^f) P+ E ^(f) p <s mp f + £?Wr)) T <fa + C b * d s^ +1 . 



T677' 



TeT bd 



Combining this with the estimate (2.28) we obtain 



erM%)^ < U(K M (7t z )+B M u;(r)Ydz + C* M s 
and therefore, since - = tt + -, 



(K M (ft z ) + 3B M u(r)) T dz + C hd s 



\jmmv[#{T,)*er.) < (J^ (ATm(tt 2 ) + 3B A/ u;(r)) T dz J . 
It is now time to observe that for any fixed M, 

lim / (K m (tc z ) + 3B M u{r)) T dz = / K T M (iT z )dz, 
and that, since Km (ft) converges decreasingly to K(tt) := K m>p (7r) for any 7r G H r 



lim / K M (n z )dz = f K T (n z )dz. 



Therefore, for all e > 0, we can choose M sufficiently large and r sufficiently small, such 
that 

limsup(#(T s )^e rs ) < (J K T {ir z )d z y + e. 



This gives us the previously mentionned statement of Theorem 2.1.2, by defining 

s N := min{s > ; #(T S ) < N}, 



2.3. Optimal estimates 



89 



and by setting Tn = % N - 

The adaptation of the above proof in the case p = oo is not straightforward due to 
the fact that the contribution to the error of 7^ bd is no longer negligible with respect to 
the contribution of 7^ reg . For this reason, one needs to modify the construction of 7^ bd . 
Here, we provide a simple construction but for which the resulting triangulation T s is 
nonconforming, as we do not know how to produce a satisfying conforming triangulation. 

More precisely, we define 7^ reg in a similar way as for p < oo, and add to the construc- 
tion of 7^ bd a post-processing step in which each triangle is split into 4 J similar triangles 
according to the midpoint rule. Here we take for j the smallest integer which is larger 
than — 4°Qg 2 ■ With such an additional splitting, we thus have 

max diam(T) < maxdiam(sTfj) = C a s 1+ ±. 

The contribution of 7^ bd to the L°° interpolation error is bounded by 

e rbd (/) < C d max diam(T) m < CZ A s^ , 
Tel -hd 

with C* d := C0C1C7. We also have 

#(7; bd ) < c hdS -v\ 

which remains negligible compared to s~ 2 . We therefore obtain 

#(T S ) < s~ 2 U(K M (* Z ) + 3B M u(r))^dz + C bd s 1/2 ) . ( 2 . 3 0) 



Moreover, if T G 7^ reg and T C R G 71, we have according to the estimate (2.25), 

e T (f) < (K M (n bR ) + 5 M u;(max{r, C A s})) |T|? 

By construction |T| = s 2 (KM(^b R ) + 2i?Mw(r))^ 2 / m . This implies &r(f) < s m when 
Cas < r. Therefore, 

e Ts{f) = niax{e7;>'<=g, e T bd} < s m max{l, C^ d s^}. 



Combining this estimate with (2.30) yields 



limsup(#(7;)^e rs ) <( f (K M (ir z ) + 3B M u(r))™ dz 



2 



and we conclude the proof in a similar way as for p < 00. 
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2.4 The shape function and the optimal metric for 
linear and quadratic elements 

This section is devoted to linear (m = 2) and quadratic (m = 3) elements, which 
are the most commonly used in practice. In these two cases, we are able to derive an 
exact expression for K mjt (Tv) in terms of the coefficients of tt. Our analysis also gives us 
access to the distorted metric which characterizes the optimal mesh. While the results 
concerning linear elements have strong similarities with those of j4], those concerning 



quadratic elements are to our knowledge the first of this kind, although 24 analyzes a 
similar setting. 

2.4.1 Exact expression of the shape function 

In order to give the exact expression of K mp , we define the determinant of a homoge- 
neous quadratic polynomial by 

det (arc 2 + 2bxy + cy 2 ) = ac — b 2 , 

and the discriminant of a homogeneous cubic polynomial by 

disc(ax 3 + bx 2 y + cxy 2 + dy 3 ) = b 2 c 2 - Aac 3 - Ab 3 d + 18abcd - 27a 2 d 2 . 

The functions det on H2 and disc on H3 are homogeneous in the sense that 

det(ATr) = A 2 det tt, disc(ATr) = A 4 disc tt. (2.31) 

Moreover, it is well known that they obey an invariance property with respect to linear 
changes of coordinates : 

det(vr o 0) = (det 0) 2 det tt, disc(vr o 0) = (det 0) 6 disc tt. (2.32) 

Our main result relates K m>p to these quantities. 

Theorem 2.4.1. We have for all n E M2, 



-^2,p(tt) = <7 p (det7r)v| det7r|, 

and for all it G M3, 

K3,p(ir) = cr*(disc7r) vl disc7r|, 
where o~ p (t) and o~*(t) are constants that only depend on the sign oft. 



The proof of Theorem 2.4.1 relies on the possibility of mapping an arbitrary polynomial 
71 G H2 such that det(7r) 7^ or 71 G H3 such that disc(7r) 7^ onto two fixed polynomials 
7r_ or 7r + by a linear change of variable and a sign change. 

In the case of H2, it is well known that we can choose 7r_ = x 2 — y 2 and tt + = x 2 + y 2 . 
More precisely, to all tt G H 2) we associate a symmetric matrix Q n such that tt(z) = 
(Q n z,z). This matrix can be diagonalized according to 

Ai C 
A 2 



Q, = U T [ ° )U, UeG 2 , A 1; A 2 G 
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Then, denning the linear transform 



|Ax| 







I A, 



and = sign(Ai) G { — 1, 1}, it is readily seen that 

AttVT 0(j) n = 



x 2 + y 2 



if det Ti > 0, 
2 if det 7r < 0. 



ar - r 

In the case of H 3 , a similar result holds, as shown by the following lemma : 

Lemma 2.4.2. Let tt G M 3 be such that disc7r ^ 0. There exists a linear transform 
such that 

x(x 2 — 3y 2 ) if disc7r > 0, 
x(x 2 + 3y 2 ) if disc7r < 0. 



TT O 



(2.33) 



Proof: Let us first assume that tt is not divisible by y so that it can be factorized as 

X{x - T X y){x - r 2 y)(x - r 3 y), 



TT 



with A G K and G C. If disc tt > 0, then the are real and we may assume r\ < r 2 < r$. 
Then defining 

6 -Af2disc7rr 1 / 3 ^ r i( r 2 + r 3)-2r 2 r 3 (r 2 - r 3 )nV3 

an elementary computation shows that tt o </>„. = x(x 2 — 3y 2 ). If disc7r < 0, then we 
may assume that r% is real and r 2 and r 3 are complex conjugates with Im(r 2 ) > 0. Then 
defining 



A(2 disc7r) 



-1/3 



ri(r 2 + r 3 ) - 2r 2 r 3 i(r 2 - r 3 )r i v / 3 
2ri-(r 2 + r 3 ) i(r 2 - r 3 )\/3 

an elementary computation shows that i\o^ v = x(x 2 + 3y 2 ). Moreover, it is easily checked 
that 07,- has real entries and is therefore a change of variable in IR 2 . 

In the case where 7r is divisible by y, there exists a rotation U G (9 2 such that tt := noU 
is not divisible by y. By the invariance property (2.32) we know that disc7r = disc7f. Thus, 
we reach the same conclusion with the choice <b m 



Uo 



o 



Proof of Theorem |2.4.1| For all n G H 2 such that det n and for all change of 
variable and A ^ 0, we may combine the properties of the determinant in (2.31) and 
(2.32) with those of the shape function established in Proposition 2.2.3 This gives us 

K 2 , p (ir) K 2t p(\n o <p) 



Applying this with 



■v/l det 7r| v/|det(A7To0)|' 
V and A = X n , we therefore obtain 



K. 



2.p 



TT 



TT 



^2, P (x 2 + y 2 ) 
K 2 , p {x 2 - y 2 ) 



if det 7T > 0, 
if det n < 0. 
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This gives the desired result with <J p (t) = K 2tP (x 2 + y 2 ) for t > and o~ p (t) = K 2)P (x 2 — y 2 ) 
for t < 0. In the case where det7r = 0, tt is of the form ir(x,y) = \(ax + (3y) 2 , and we 
conclude by Proposition 2.2.1 that K 2jP (tt) 



0. 



For all 7r G H 3 such that disc n ^ 0, a similar reasoning yields 



K 



3,p 



7T 



disc vr 108 



-^3,p(^(^ 2 - 3y 2 )) if disc7r > 0, 
K 3 , p (x(x 2 + 3y 2 )) if discvr < 0, 



where the constant 108 comes from the fact that disc(x(x 2 — 3y 2 )) 
108. This gives the desired result with a*{t) = lQ%~±K^ p (x(x 2 
cr*(t) = 108^2 K 3:P (x(x 2 + 3y 2 )) for t < 0. In the case where disc7r 



— disc(x(x 2 — 3y 2 )) = 
3y 2 )) for t > and 
= 0, it is of the form 



7r(x, y) = {ax + j3y) I (7a; + fo/), and we conclude by Proposition 2.2.1 that i^3 jP (7r) = 0. o 



Remark 2.4.3. We do not know any simple analytical expression for the constants in- 
volved in a p and a*, but these can be found by numerical optimization. These constants 
are known for some special values of p in the case m = 2, see for example 



2.4.2 Optimal metrics 



Practical mesh generation techniques such as in 15 , 19 66 , 92 , 94 are based on the 



data of a Riemannian metric, by which we mean a field h of symmetric definite positive 
matrices 

x G Q h-> h(x) G S£. 

Typically, the mesh generator takes the metric h as an input and hopefully returns a 
triangulation Th adapted to it in the sense that all triangles are close to equilateral of unit 



side length with respect to this metric. Recently, it has been rigorously proved in 15,85 
see also Chapter 5, that some algorithms produce bidimensional meshes obeying these 
constraints under certain conditions. This must be contrasted with algorithms based on 



heuristics, such as [92] in two dimensions and 94 in three dimensions, which have been 
available for some time and offer good performance 20 but no theoretical guaranties. 
See (56| for a review of these mesh generation techniques. 

For a given function / to be approximated, the field of metrics given as input should 
be such that the local errors are equidistributed and the aspect ratios are optimal for the 
generated triangulation. Assuming that the error is measured in X = LP and that we are 
using finite elements of degree m — 1, we can construct this metric as follows, provided 
that some estimate of n z = d ^\ z ^ is available for all points z G Vt. An ellipse E z such that 
|^| is equal or close to 



sup 



\E\ 



(2.34) 



is computed, where A 7fz is defined as in (2.14). We denote by h 7Tz G SP the associated 
symmetric definite positive matrix such that 



E z = {{x,y) ; {x,y) T K z (x,y) < 1} 
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Let us notice that the supremum in (2.34) might not always be attained or even be finite. 
This particular case is discussed in the end of this section. Denoting by v > the desired 
order of the LP error on each triangle, we then define the metric by rescaling h- Kz according 
to 

1 y _ 1 

h(z) = —h w where a z := u m p+ 2 \E z \ ™p+2. 
Or z 

z 

With such a rescaling, any triangle T designed by the mesh generator should be compa- 
rable to the ellipse z + a z E z centered around z the barycenter of T, in the sense that 



z + c\ot z E z C T C z + c 2 a z E z (2.35) 

for two fixed constants < 2ci < c 2 independent of T (recall that for any ellipse E there 
always exists a triangle T such that £cTc 2E). 

Such a triangulation heuristically fulfills the desired properties of optimal aspect ratio 
and error equidistribution when the level of refinement is sufficiently small. Indeed, we 
then have 



e m ,T{f) P ~ e miT (vr z ) p 

= |Kz - It~ Kz\\lp(T), 
1 , 
\p\\n z -L T vr 2 || L oo( T ), 

i 

~ \T\f \\ir z \\L°°(T), 
i 

~ \OL z E z \p 1 1 loo ( aa ^), 
m+- l 

= a z p \E z \p\\7r z \\ L °°( Ez ), 



where we have used the fact that tv z e H m is homogeneous of degree m. 

Leaving aside these heuristics on error estimation and mesh generation, we focus on 
the main computational issue in the design of the metric h(z), namely the solution to the 
problem (2.34) : to any given n G H m , we want to associate h n G S 1 ^ such that the ellipse 
E w defined by h n has area equal or close to swp Ee£ EcAv \E\. 

When m = 2 the computation of the optimal matrix h n can be done by elementary 
algebraic means. In fact, as will be recalled below, h n is simply the absolute value (in the 
sense of symmetric matrices) of the symmetric matrix [n] associated to the quadratic form 
7i. These facts are well known and used in mesh generation algorithms for P x elements. 

When m > 3 no such algebraic derivation of from tt has been proposed up to now, 
and current approaches instead consist in numerically solving the optimization problem 



(2.16), see 23 . Since these computations have to be done extremely frequently in the 
mesh adaptation process, a simpler algebraic procedure is highly valuable. In this section, 
we propose a simple and algebraic method in the case m — 3, corresponding to quadratic 
elements. For purposes of comparison, the results already known in the case m = 2 are 
recalled. 
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Figure 2.2 - Maximal ellipses inscribed in A n , tt = x(x 2 — 3y 2 ) or tt 



x\x 



+ 3y 2 ). 



Proposition 2.4.4. 1. Letir G Mi be such that det(w) ^ 0, and consider its associated 
2x2 matrix which can be written as 



M = t/ T ( 



Ai 
A 2 



U, Ue 2 . 

Then an ellipse of maximal volume inscribed in A n is defined by the matrix 



hr 







( 


|A 2 | J 



u. 



2. Let tt G M% be such that disc7r > ; and let (f) n be a matrix satisfying (2.33). Define 

j.-l\T i-l 



) 



(2.36) 



Then h n defines an ellipse of maximal volume inscribed in A n . Moreover, deth v 
£ -^— (disc IT) 3 . 



3. Let tt G Ms be such that disc7r < ; and (j) v a matrix satisfying (2.33). Define 



K = 23| 



-hT i-l 



Then h n defines an ellipse of maximal volume inscribed in A w . Moreover, deth^ = 
1 1 disc TT | 3 . 

Proof: Clearly, if the matrix h„ defines an ellipse of maximal volume in the set A,,-, 
then for any linear change of coordinates 0, the matrix (0 _1 ) T /?, 7r 0~ 1 defines an ellipse of 
maximal volume in the set A^o^. When tt G H2, we know that X n TT o tfi^ = x 2 + y 2 when 
det tt > 0, and x 2 — y 2 when det7r < 0, where jA^I = 1. When tt G H3, we know from 
that TTO(j) n = x{x 2 — 3y 2 ) when disc tt > 0, and x(x 2 + 3y 2 ) when disc tt < 0. 



2.4.2 



y , x(x — 3y )}, then 



Lemma 

Hence it only remains to prove that when tt G {x 2 + y 2 , x 2 
h n = Id, which means that the disc of radius 1 is an ellipse of maximal volume inscribed 
in A,,-, while when tt = x(x 2 + 3y 2 ) we have h n = 2 1//3 Id. 

The case tt = x 2 + y 2 is trivial. We next concentrate on the case tt = x(x 2 + 3y 2 ), the 
treatment of the two other cases being very similar. Let E be an ellipse included in A,,., 
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tt = x(x 2 + 3y 2 ). Analyzing the variations of the function 7r(cos#, sin#), it is not hard to 
see that we can rotate E into another ellipse E', also satisfying the inclusion E' C A„., 
and whose principal axes are {x = 0} and {y = 0}. We therefore only need to consider 
ellipses of the form kx 2 + hy 2 < 1. For a given value of h, we denote by k(h) the minimal 
value of k for which this ellipse is included in A w . Clearly, the boundary of the ellipse, 
defined by k{h)x 2 + hy 2 = 1, must be tangent to the curve defined by ir(x, y) = 1 at some 
point (x, y). This translates into the following system of equations : 

n(x,y) = 1, 

hx 2 + ky 2 = 1, (2.37) 

kyd x n(x,y) - hxd x ir(x,y) = 0. 

Eliminating the variables x and y from this system, as well as negative or complex-valued 
solutions, we find that k(h) = when h G (0, 2], and k(h) = k(2) = 1 when h > 2. The 
minimum of the determinant hk{h) — | (| + /i 2 ) is attained for h — 23. Observing that 
k(2s) = 25, we obtain, as previously stated, h w = 2 1 / 3 Id and that the ellipse of l arge st 



area included in A^ is the disc of equation 2 1 ^ 3 (x 2 + y 2 ) < 1, as illustrated on Figure 2. 2 b. 



The same reasoning applies to the other cases. For tt = x 2 — y 2 we obtain k(h) = h, 
h G (0,oo). In this case the determinant hk(h) is independent of h, and we simply 
choose h = 1 = k(l). For ir = x[x 2 — 3y 2 ) we obtain k(h) = when h G (0,1] 

and k(h) = k(l) = 1 when h > 1. The maximal volume is attained when h = 1, corres- 



ponding to the unit disc, as illustrated on Figure 2.2 a. o 



Remark 2.4.5. When tt G M 3 and disc7r > a surprising simplification happens : 
the matrix (12.361) has entries which are symmetric functions of the roots ri,r 2 ,r 3 . Using 



the relation between the roots and the coefficients of a polynomial, we find the following 
expression : if tt = ax 3 + 3bx 2 y + 3cxy 2 + dy 3 , then 



fin = 2 3 3 (disc 7r) 



=1 ( 2(b 2 - ac) 
be — ad 



bc- 
2(c 2 



ad 
-bd) 



This yields a direct expression of the matrix as a function of the coefficients. Unfortunately, 
there is no such expression when disc7r < 0. 



At first sight, Proposition 2.4.4 might seem to be a complete solution to the problem 
of building an appropriate metric for mesh generation. However, some difficulties arise 
at points z G fl where det7r 2 = or disc7r 2 = 0. If tt G H 2 \ {0} and det7r = 0, then 
up to a linear change of coordinates, and a change of sign, we can assume that tt = x 2 . 
The minimization problem clearly yields the degenerate matrix h n = diag(l, 0), the 2x2 
diagonal matrix with entries 1 and 0. If tt G H3 \ {0} and disc7r = 0, then up to a linear 
change of coordinates either tt = x 3 or tt = x 2 y. In the first case the minimization problem 
gives again h n = diag(l,0). In the second case a wilder behavior appears, in the sense 
that minimizing sequences for the problem (2.34) are of the type h n = diag(e _1 ,e 2 ) with 



e — > 0. The minimization process therefore gives a matrix which is not only degenerate, 
but also unbounded. 
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These degenerate cases appear generically and constitute a problem for mesh genera- 
tion since they mean that the adapted triangles are not well defined. Current anisotropic 
mesh generation algorithms for linear elements often solve this problem by fixing a small 
parameter 5 > and working with the modified matrix h n := h n + 5 Id which cannot 
degenerate. However, this procedure cannot be extended to quadratic elements, since h x i y 
is both degenerate and unbounded. 

In the theoretical construction of an optimal mesh which was discussed in £ 2.3.2 we 
tackled this problem by imposing a bound M > on the diameter of the triangles. This 
was the purpose of the modified shape function K m (ti) and of the triangle T M (ir) of 
minimal interpolation error among the triangles of diameter smaller than M. We follow 
a similar idea here, looking for the ellipse of largest area included in with constrained 
diameter. This provides matrices which are both positive definite and bounded and which 
vary continuously with respect to the data tt G H3. A similar construction is presented 



in £ 6.5.3 of Chapter 6 in arbitrary degree m and dimension d, however the expression 
of the matrix in terms of the polynomial tt is less explicit in that general context. The 
constrained problem, depending on a > 0, is the following : 

sup{|£| ; E G S, E C A n and diam£ < 2oT 1/2 }, (2.38) 

or equivalently, 

inf{det# ; H G S 2 + s.t. (Hz,z) > \n(z)\ 2/m ,z G M 2 , and H > aid}. (2.39) 



We denote by E^^ and h Wja the solutions to (2.38 ) and (2.39 ) respectively. In the remainder 



of this section, we show that this solution can also be computed by a simple algebraic 
procedure, avoiding any kind of numerical optimization. In the case where tt G H 2 , it can 
easily be checked that 

[K. a } = U-( max{ ^ a} ° , )u, (2.40) 

V max{|A 2 |, a\, J v 7 



as illustrated in Figure 2.3 



When 7r G H 3 , the problem is more technical, and the matrix h W:a takes different forms 
depending on the value of a and the sign of disc7r. In order to describe these different 
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regimes, we introduce three real numbers < < a w < //^ and a matrix U w G O2 which 
are defined as follows. We first define fi n by 

^■.^mmQzW ; \ir(z)\ = 1}, 

the radius of the largest disc D n inscribed in A,,.. For z v such that 1 7r ( ) | = 1 and 

1/2 

II £71- II = Utt , we define U n as the rotation which maps z n to the vector (H^H, 0). We then 
define a n by 

2a^ 1/2 := max{diam(£) ; E G 8 ; D n C E C A J, 

the diameter of the largest ellipse inscribed in A^ and containing the disc D n . In the case 
where it is of the form (ax + by) 3 , this ellipse is infinitely long, and we set a n = 0. We 
finally define (3 n by 

2/3 7 7 1 / 2 :=diam(£ 7r ), 

where E n is the optimal ellipse described in Proposition 2.4.4| In the case where disc n = 0, 
the "optimal ellipse" is infinitely long, and we set j5 % = 0. It is readily seen that < (3 n < 

All these quantities can be algebraically computed from the coefficients of it by solving 
equations of degree at most 4, as well as the other quantities involved in the description 
of the optimal h^^ and E n<Q in the following result. 

Proposition 2.4.6. For it G M 3 and a > 0, the matrix h^^ and ellipse E n>a are described 
as follows : 

1. If a > Hk, then h n>a = aid and E a ^ is the disc of radius or 1 ! 2 . 

2. If a n < en < u. n , then 

K, a = U?(^ l^jU^ (2.41) 

and E a is the ellipse of diameter 2a -1 / 2 which is inscribed in A n and contains D^. 
It is tangent to dA n at the two points z^ and —z n . 

3- If fin < 01 < a w then E Wj0l is tangent to dA^ at four points and has diameter 2a~ 1 / 2 . 
There are at most three such ellipses, and E Wt0l is the one of largest area. The matrix 
h Wja has a form which depends on the sign of disc it. 
(i) If disc 7T < 0, then 



U 3A2 



where (f) n is the matrix defined in Proposition 2.4.4 and X a is determined by det(/i 7r>Q ,— 
aid) = 0. 

(ii) If disc 7T > 0, then 
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Figure 2.4 - The set A,, (solid), the disc E n ^ = D w (solid), the ellipse E ni(3hr (solid), 
and the ellipses E^ >a (dashed) for various values of a > when tt G H 3 and a G (a w , oo). 
Left : disc7r < 0. Right : disc7r > 



Figure 2.5 - The set A^ (solid), the ellipse E^a* (solid), the ellipse E^,^ = E n (solid), 
and the ellipses E n>a (dashed) for various values of a > when it G H 3 and a G (f3 n , a n ). 
Left : disc7r < 0. Right : disc7r > 

where and X a are given as in the case disc tt < and where V is chosen between 
the three rotations by 0, 60 or 120 degrees so as to maximize \E a ^\. 
(Hi) If disc 7r = and a n > 0, then there exists a linear change of coordinates <ft 
such that tt o cj) = x 2 y and we have 



i-l\T 



K o 
o 



27X1 



where X a is determined by det(h n ^ a — aid) = 0. 

4- If a < fin, then h nta = h w , and E n ^ a = E m is the solution of the unconstrained 
problem. 

Proof: See Appendix. o 



Figure 2.4 illustrates the ellipses E^ ta , a G (a*, oo) when disc7r > ( 2.4 



2.4 b). Figure 



2.5 illustrates the ellipses E^ >a , a G (/3 n ,a w ) when disc7r > ( 2.5 



or disc 7r < 
a) 



or 



disc7r < ( 2.5 b). Note that when a > a W} the principal axes of E w<a are independent 
of a, since U n is a rotation that only depends on tt, while these axes generally vary when 
Ar < ct < a^, since the matrix (p n is not a rotation. 



2.5. Polynomial equivalents of the shape function in higher degree 
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Remark 2.4.7. For interpolation by cubic or higher- degree polynomials (m > 4), an 
additional difficulty arises that can be summarized as follows : one should be careful not 
to "overfit" the polynomial it with the matrix h n . An approach based on exactly solving 



the optimization problem (2.34) might indeed lead to a metric h(z) with unjustified strong 
variations with respect to z and/or bad conditioning, and jeopardize the mesh generation 
process. As an example, consider the one-parameter family of polynomials 

n t = x 2 y 2 + ty 4 G H A , te [-1,1]. 

It can be checked that when t > 0, the supremum S + = swpEes,EcA n \E\ is finite and 
independent of t, but not attained, and that any sequence E n C of ellipses such that 
lim n _ 5 . 00 \E n \ = S + becomes infinitely elongated in the x direction as n — >• oo. For t < 0, 
the supremum SL = sup Egf ecK l-^l ^ s independent oft and attained for the optimal 

ellipse of equation \t\~ 1 / 2 ^ 2 ~ 1 x 2 + |t| 1//2 ?/ 2 < 1. This ellipse becomes infinitely elongated 
in the y direction as t — > 0. This example shows the instability of the optimal matrix 
h n with respect to small perturbations of n. However, for all values of t G [—1,1], these 
extremely elongated ellipses could be discarded in favor, for example, of the unit disc 
D = {x 2 + y 2 < 1}, which obviously satisfies D C A„- t and is a near-optimal choice in the 



sense that 2\D\ = S+ < S- = \D\J2(y/2 + l) 



2.5 Polynomial equivalents of the shape function in 
higher degree 

In degrees m > 4, we could not find analytical expressions of K m ^ p or K m and do 
not expect them to exist. However, equivalent quantities with analytical expressions are 
available, under the same general form as in Theorem 2.4.1 : the root of a polynomial in 



the coefficients of the polynomial 7r G H m . This result improves on the analysis of 25 
where a similar setting is studied. 

In the following, we say that a function R is a polynomial on H m if there exists a 
polynomial P of m + 1 variables such that for all (a , • • • , a m ) G M m+1 , 



, i=0 

and we define degR := degP. 

The object of this section is to prove the following theorem : 

Theorem 2.5.1. For all degree m > 2, there exists a polynomial K m on Hf m , and a 
constant C m > such that for all it G M m and all 1 < p < oo, 



^- VK m (7T) < K m ^Tl) < C m VK m (7T) 



where r m = degK m . 
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Since for fixed m all functions K miP , 1 < p < oo, are equivalent on H m , there is 
no need to keep track of the exponent p in this section, and we use below the notation 
K m = K m>00 . In this section, the reader should not confuse the functions K m and K m , nor 
the polynomials Qd and Qd below, which notations are only distinguished by their case. 

Theorem 2.5.1 is a generalization of Theorem 2.4.1 , and the polynomial K m involved 
should be seen as a generalization of the determinant on IH 2 and of the discriminant on 
H3. Let us immediately stress that the polynomial K m is not unique. In particular, we 
shall propose two constructions that lead to different K m with different degree r m . Our 
first construction is simple and intuitive but leads to a polynomial of degree r m that grows 
quickly with m. Our second construction uses the tools of invariant theory to provide a 
polynomial of much smaller degree, which might be more useful in practice. 

We first recall that there is a strong connection between the roots of a polynomial in 
H 2 or H 3 and its determinant or discriminant : 



, r 2 ) 2 . 
4 1 



det I A J] (x-nyU = ^A 2 (r x 

\ l<i<2 J 

disc A Y\ ( x - r iV) = A 4 (ri - r 2 ) 2 (r 2 - r 3 ) 2 (r 3 - r x ) 2 . 



Ki<3 



We now fix an integer m > 3. Observing that these expressions are a "cyclic" product 
of the squares of differences of roots, we define 



§(A, r 1: ■ ■ ■ , r m ) := A (n - r 2 ) • • • (r m _i - r m ) (r m - n) . 

Since m > 3, this quantity is not invariant anymore under reordering of the ?v For any 
positive integer d, we introduce the symmetrized version of the empowers of the cyclic 
product 

Q d (X,n, ■■■ ,r m ) := ^2 s (^ r ffn ' ' ' i r * m ) d i 
where S m is the set of all permutations of {1, • • • , m}. 

Proposition 2.5.2. For all d > there exists a homogeneous polynomial of degree 4d 
on M m , "with integer coefficients, and such that : 

Ifn = \ Y[(x - r { y), then Q d (vr) = Q d (X, r u ■ ■ • , r m ). 

i=l 

In addition, Qd obeys the invariance property 

Q d (7ro0) = (det0) 2mrf Q d (7r). (2.42) 
Proof: We denote by <7; the elementary symmetric functions in the r^, in such way that 

m 

l[(x - ny) = x m - o x x m - l y + a 2 x m - 2 y 2 - ■ ■ ■ + {-l) m a m y m . 

i=l 
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A well-known theorem of algebra (see, e.g., Chapter IV.6 in [67]) asserts that any symme- 
trical polynomial in the r t can be reformulated as a polynomial in the ex,. Hence for any 
d there exists a polynomial Qd such that 

Q d (l, ri, • • • , r m ) = Q d ((Ti, ■■■ , <7 m ). 

In addition, it is known that the total degree of Qd is the partial degree of Qd in the 
variable r%, in our case 4d, and that Qd has integer coefficients since Qd has. 

Given a polynomial tt G H m not divisible by y, we write it under the two equivalent 
forms : 

m 

n = a x m + a x x m ~ x y + ■■■ + a m y m = A - ny). 



Clearly a = A and <jj = (— 1)*^. It follows that 



Q d (X, n, - • ■ ,r m ) = A Q d (a u ■ ■ ■ , a m ) = a Q 



-ai (-l) m a r , 



a a 



Since deg Qd = 4d, the negative powers of a due to the denominators are cleared by the 
factor ciQ d , and the right-hand side is thus a polynomial in the coefficients a , • • • , a m that 
we denote by Qd(7r). 

We now prove the invariance of with respect to linear changes of coordinates ; this 
proof is adapted from [61]. By continuity of Qd, it suffices to prove this invariance property 
for pairs (tt, <ft) such that cf) is an invertible linear change of coordinates and neither tt or 
tt o -1 is divisible by y. 

Under this assumption, we observe that if tt = A YliLi ( x ~~ r iV) aric ^ 4> = ( ^ <J y ' 

then tt o cf)^ 1 = A n^il 3 - — ^y), where 

ari + (3 



A = A(det0)- m TT( 7 + 5r l ) and ., 

i=l 



Observing that 

det i 



U - Tj = —t- r^\ r i ~ r j), 



('yn + 5)(^rj + 5) 
it follows that 

S(A,fi,--- ,f m ) = (det0)- 2m §(A,r 1 



The invariance property (2.42) follows readily. 

We now define r m = 21cm{degQrf ; 1 < d < ml}, where lcm{ai,--- , a*,} stands for 
the lowest common multiple of {a\, ■ ■ ■ , a{}, and we consider the following polynomial on 
Hi™ : 



7.! 



K m :=^Q 



deg Q d 

d 

d=l 



Clearly, K m has degree r m and obeys the invariance property K m (7ro0) = (det 0) 2 K m (7r). 
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Lemma 2.5.3. Let n G LH m . LfK m (ir) = 0, then K m (ir) = 0. 

Proof: We assume that K m (7r) ^ and intend to prove that K m (7r) ^ 0. Without loss 
of generality, we may assume that y does not divide n, since K m (n o JJ) = K m {ji) and 
K TO (7ro[7) = K m (7r) for any rotation U. We thus write 7r = A YYiLi( x ~ r iU) > where r« G (D. 



Since K m (Tr) ^ 0, we know from Proposition 2.2.1 that there is no group of s m := [yj + 1 
equal roots ?y 

We now define a permutation a* G S m such that r a ,^ ^ r CT *(j + i) for 1 < z < m — 1 
and r CT *( m ) ^ r CT *n). In the case where m = 2m' is even and m! of the are equal, any 
permutation a* such that r CT *(i) = r CT *(3) = ■ ■ ■ = r a *(2 m '-i) satisfies this condition. In all 
other cases let us assume that the are sorted by equality : if i < j < k and = r^, then 
Ti = Tj = rfc. If m = 2m' is even, we set a*(2i — 1) = i and a*(2i) = m' + i, 1 < i < m! . If 
m = 2m'+l is odd we set a*(2i) = i, 1 < i < m! and a*(2i — 1) = m'+i, 1 < i < m'+l. For 
example, a* = (4 1 5 2 6 3 7) when m = 7 and a* = (1 5 2 6 3 7 4 8) when m = 8. With such 
a construction, we find that \a*(i)—a*(i+l)\ >m'iim is odd and \a*(i)—a*(i+l)\ > m'—l 
if m is even, for all 1 < i < m, where we have set a*(m + 1) := cr*(l). Hence a satisfies 
the required condition, and therefore S(A,r (7 *( 1 ), • • • , r CT *( m )) 7^ 0. 



It is well known that if k complex numbers a>i, ■ ■ ■ , G (D are such that o^ + - • -+a 



k 



0, for all 1 < d < A;, then ot\ — • • • — ot^ = 0. Applying this property to the m! complex 
numbers S(A,r (T ( 1 ), • • • ,r CT ( m )), a G S m and noticing that the term corresponding to a* is 
non zero, we see that there exists 1 < d < m\ such that Qd(7r) = Qd(A,ri, • • • ,r m ) 7^ 0. 
Since has real coefficients, the numbers Qd(7r) are real. Since the exponent r m J degQd 
is even, it follows that K m (7r) > 0, which concludes the proof of this lemma. o 



The following proposition, when applied to the function K c „ = rr ^/K. m , concludes the 



proof of Theorem 2.5.1 



Proposition 2.5.4. Let m > 2, and let K eq : LH m — > R + be a continuous function obeying 
the following properties : 

1. Invariance property : K cq (n o 0) = | det K eq (7r) . 

2. Vanishing property : for all ti G LH m , if K cq {it) = 0, then i^ m (vr) = 0. 
Then there exists a constant C > such that ^K eq < K m < CK eq on LH m . 

Proof: We first remark that K eq is homogeneous in a similar way as K m : if A > 0, then 
applying the invariance property to (j) = A« Id yields K cq (n o (A™ Id)) = i^ e q(A7r) and 
I det cj)\~2 = A. Hence i^ e q(A7r) = \K eq (7r). 

Our next remark is that a converse of the vanishing property holds : if K m (7i) = 0, 
then there exists a sequence 4> n of linear changes of coordinates, det <j) n = 1, such that 
7r o cj) n — y as n — > 00. Hence K eq (ii) = K eq (ir o ra ) — y K eq (0). Furthermore, K eq (0) = 
by homogeneity. Hence i^ cq (7r) = 0. 

We define the set NF m := {n G H m ; K m {ix) = 0}. We also define a set A m C H m 
by a property "opposite" to the property defining NF m . A polynomial n G H m belongs to 
A m if and only if 

||7r|| < \\ir o M for all 6 such that detd) = 1. 
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The sets NF m and A m are closed by construction, and clearly NF m fl A m = {0}. We now 
denote by K m the lower semi continuous envelope of K m , which is defined by 

K m (-n) = lim inf K m (ir') 

r— >0 \\tt' — Tr\\<r 

the lower semi-continuous envelope of K m . If K m (7r) = 0, then there exists a converging 



sequence ir n — y tt such that K m (7T n ) — >■ 0. According to Proposition 2.2.4, it follows that 
K m (Tr) = and hence 7r G iVF m . Therefore, the lower semi-continuous function K m and 
the continuous function K eq are bounded below by a positive constant on the compact set 
{n G A m , \\ir\\ = 1}. Since in addition K eq is continuous and K m is upper semi- continuous, 
we find that the constant 



C = sup max 



K eq (n) K m (ir) 



7r6A m ,|j7r||=l L Krni.^) ' K Bq {Tz) 

is finite. By homogeneity of K m and K eq , we infer that on A m , 

^K eq <Kn<K m < CK cq . (2.43) 

Now, for any 7r G H m , we consider tt of minimal norm in the closure of the set {tt o 
; det0 = 1}. By construction, we have n G A m , and there exists a sequence n , 
det0 n = 1 such that tt o n — y ft as n — > oo. If 7r = 0, then K m (ir) = K eq (7r) = 0. 
Otherwise, we observe that 

-KmQr) < ifm(7r) < ^m(Tr) and if eq (7r) = K eq (7r), 
where we used the fact that K m , K m , and K eq are respectively lower semi-continuous, 



upper semi-continuous, and continuous on H m . Combining this with inequality (2.43) 



concludes the proof. o 
A natural question is to find the polynomial of smallest degree satisfying Theorem 



2.5.1 This leads us to the theory of invariant polynomials introduced by Hilbert 61 
(we also refer to [52] for a survey on this subject). A polynomial R on H m is said to be 
invariant if jj = m ^ gR is a positive integer and for all it G H m and any linear change of 
coordinates 0, one has 

i?(7r o(f)) = (det <pYR(n) . (2.44) 

We have seen for instance that K m and are "invariant polynomials" on HI m . 

Nearly all the literature on invariant polynomials is concerned with the case of complex 
coefficients, both for the polynomials and the changes of variables. It is known in particular 



52 that for all m > 3, there exists m — 2 invariant polynomials Ri, ■ ■ ■ R m -2 on H m such 
that for any 7r (complex coefficients are allowed) and any other invariant polynomial R 
on H m : 

If #i(tt) = • • • = i? m _ 2 (7r) = 0, then R(ir) = 0. (2.45) 
A list of such polynomials with minimal degree is known explicitly at least when m < 8, 



and they have real coefficients. Defining r = 21cm(degi?j) and K eq := \JY1T=i % we 
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see that K eq (ir 



implies K m (7r) = 0, and hence K m (ir) = 0. According to proposition 
2.5. 4[ we have constructed a new, possibly simpler, equivalent of K m . 

For example, when m = 2 the list is reduced to the polynomial det, and for m = 3 
to the polynomial disc. For m = 4, given tt 
consists of the two polynomials 



ax 



+ Abx y + 6cx y + Adxy + ey , the list 



ae 



46d + 3c 2 , J 



a b c 
bed 
c d e 



therefore K^ir) is equivalent to the quantity y\I(n)\ 3 + J(it) 2 . As m increases, these 
polynomials unfortunately become more and more complicated, and their number m — 2 
obviously increases. According to [52], for m = 5 the list consists of three polynomials of 
degrees 4, 8, 12, while for m = 6 it consists of 4 polynomials of degrees 2, 4, 6, 10. 



2.6 Extension to higher dimension 



The function K m p can be generalized to higher dimension d > 2 in the following way. 
We denote by H m ^ the set of homogeneous polynomials of degree m in d variables. For 
all d-dimensional simplex T, we define the interpolation operator I™ -1 acting from C°(T) 
onto the space P m _i j( 2 of polynomials of total degree m — 1 in d variables. This operator 
is defined by the conditions I™ -1 ^(7) = v (7) for all points 7 G T with barycentric coordi- 
nates in the set {0, — ^r> ~~ ~p ' ' ' 5 !}• Following Section £2.1.2, and generalizing definition 



(2.2) 



-1' m-l' 

we define the local interpolation error on a simplex, the global interpolation error 



on a mesh, and the shape function. 
For all tt G H m>d , 

-Km,p,d(7r) 



inf 

m=i 



TT 



jm- 
1 T 



7T 



Pi 



where the infimum is taken on all <i-dimensional simplices T of volume 1. The variant KjL 



introduced in (2.15) also generalizes in higher dimension and was introduced by Weiming 



Cao in 23 . Denoting by Ed the set of d-dimensional ellipsoids, we define 



K 



m.d 



7T 



sup 



\E\/\B\ 



inf{(detM)^ ; M G S+ and \fz G R d , (Mz, z) > \tt{z)\^}, 



{ze 



\it{z)\ < 1}, B := {z G 



where A n : = 

and where St denotes the set of symmetric positive definite d x d matrices. Similarly to 



< 1} is the unit euclidean ball, 



Proposition 
equivalent : 



2.2.5 



it is not hard to show that the functions K~ 



and K Ld [ 



there exist constants < c < C depending only on m, d, such that 



7r are 



rK z < K 

ul ^m,d — ly m,p,d 



< CK £ m4 . 



Let (Tn)n>o be a sequence of simplicial meshes (triangles if d = 2, tetrahedrons if d = 3, 
. . . ) of a bounded d-dimensional, polygonal open set Q. Generalizing (2.6), we say that 
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N)N>N Q 



is admissible if #(7at) < and if there exists a constant Ca satisfying 



sup diam(T) < C A N- l/d . 
tsTjv 



The lower estimate in Theorem |2 . 1 . 2| can be generalized, with straightforward adaptations 
in the proof. If / G C m (fl) and (7~n)n>n is an admissible sequence of simplicial meshes, 
then 

'd m f 



liminf N * e m j N {f) v > 



7V-s>oo 



K 



m,d,p 



ml 



(2.46) 



where 



in 
d 



+ 



The upper estimate in Theorem |2.1.2| however does not generalize. The reason is that 
we used in its proof a tiling of the plane consisting of translates of a single triangle and 
of its symmetric with respect to the origin. This construction is not possible anymore in 
higher dimension, for example it is well known that one cannot tile the space R 3 , with 
equilateral tetrahedra. 



The generalization of the second part of Theorem 2.1.2 is therefore the following. 
For all m and d, there exists a constant C = C(m,d) > such that for any bounded 
polygonal open set Q C M. d and / G C m (Q) the following holds : for all e > 0, there exists 
an admissible sequence (J~n)n>n °f simplicial meshes Q, possibly non conforming, such 
that 

'd m f 



limsupA^de miTiv (/) p < C 



K„ 



m,d,p 



ml 



+ e. 



The "tightness" of Theorem 2.1.2| is partially lost due to the constant C. This upper bound 
is not new and can be found in |23|. In the proof of the bidimensional theorem, we define 



by (2.26) a tiling Vr of the plane made of a triangle Tr and some of its translates and 



of their symmetric with respect to the origin. In dimension d, the tiling Vr cannot be 
constructed by the same procedure. The idea of the proof is to first consider a fixed tiling 
Vq of the space constituted of simplices bounded diameter, and of volume bounded below 
by a positive constant, as well as a reference equilateral simplex T eq of volume 1. We then 
set Vr = 4>(Vo), where is a linear change of coordinates such that Tr = 0(T eq ). This 
procedure can be applied in any dimension and yields all subsequent estimates "up to a 
multiplicative constant," which concludes the proof. 

Since this upper bound is not tight anymore, and since the functions K m)P ^ are all 
equivalent to K^ nd as p varies (with equivalence constants independent of p), there is 
no real need to keep track of the exponent p. We therefore denote by K m>d the function 

-K m ,oo,d' 

For practical as well as theoretical purposes, it is desirable to have an efficient way 
to compute the shape function K m ^, and an efficient algorithm to produce adapted tri- 
angulations. The case m — 2, which corresponds to piecewise linear elements, has been 
extensively studied, see for instance [4,27 . In that case there exists constants < c < C, 
depending only on d, such that for all 71 G H^d, 



!</|detvr| < K 24 (tt) < C</| det?r| 
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where det ir denotes the determinant of the symmetric matrix associated to ir. Further- 



more, similarly to Proposition 2.4.4 the optimal metric for mesh refinement is given by 
the absolute value of the matrix of second derivatives, see |4j[27], which is constructed in 
a similar way as in dimension d = 2 : with U and D = diag(Ai, • • • , Xd) the orthogonal 
and diagonal matrices such that [n] = U T DU and with \D\ := diag(|Ai|, ■ ■ ■ , \Xd\), we set 
h n = £/ T |.D|[/. It can be shown that the matrix h n defines an ellipsoid of maximal volume 
included in the set A^. The case m = 2 can therefore be regarded as solved. 

For values (m, d) both larger than 2, the question of computing the shape function as 
well as the optimal metric is much more difficult, but we have partial answers, in particular 
for quadratic elements in dimension 3. Following §2.5[ we need fundamental results from 
the theory of invariant polynomials, developed in particular by Hilbert |61i|. In order to 
apply these results to our particular setting, we need to introduce a compatibility condition 
between the degree m and the dimension d. 

Definition 2.6.1. We call the pair of integers m > 2 and d > 2 "compatible" if and only 
if the following holds : For all tc G M m ^ such that there exists a sequence (0 n )n>o of dx d 
matrices with complex coefficients, satisfying det0 n = 1 and linv^oo n o n = 0, there 
also exists a sequence ip n of d x d matrices with real coefficients, satisfying det ip n = 1 
and linin^oo n o yb n = 0. 



Following Hilbert 61 , we say that a polynomial Q of degree r defined on H mi d is 
invariant if /i = is a positive integer and if for all 7r G H m d and all linear changes of 
coordinates 0, 

Q(tto0) = (det0)"Q(7r). (2.47) 



This is a generalization of (2.44). We denote by R m ,d the set of invariant polynomials on 



H mi( 2. It is easy to see that if ir £ M rri: d is such that K m) d(Ti) = 0, then Q(tt) = for all 



Q ^ Em.cz- Indeed, as seen in the proof of Proposition 2.2.1, if K mjd (n) = 0, then there 
exists a sequence (fi n such that det <f> n = 1 and tt o n — y 0. Therefore, (2.47) implies that 
Q(tt) = 0. The following lemma shows that the compatibility condition for the pair (m, d) 
is equivalent to a converse of this property. 

Lemma 2.6.2. The pair (m, d) is compatible if and only if for all tt G JH m ( i, 

Km,di^) = if and only if Q{n) = for all Q G H mj d- 

Proof: We first assume that the pair (m, d) is not compatible. Then there exists a po- 
lynomial tcq G H mirf such that there exists a sequence <p n , det <f) n = 1 of matrices with 
complex coefficients such that 7r o n — y 0, but there exists no such sequence with real 
coefficients. This last property indicates that K m ^{Ti) > 0. On the contrary, let Q G l m ,d 
be an invariant polynomial, and set il = md *! g . The identity 

Q(7r o o0) = (det0rQ(7r o ) 

is valid for all <fi with real coefficients and is a polynomial identity in the coefficients of 0. 
Therefore, it remains valid if <fi has complex coefficients. If follows that Q(ttq) = Q{'Ko° ( t ) n) 
for all n, and therefore Q(ttq) = 0, which concludes the proof in the case where the pair 
(m, d) is not compatible. 
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We now consider a compatible pair (m, d). Following Hilbert 61 , we say that a poly- 
nomial 7r G H md is a null form if and only if there exists a sequence of matrices <p n with 
complex coefficients such that det n = 1 and 7r o <fi n — y 0. We denote by NF m ^ d the set 
of such polynomials. Since the pair (m, d) is compatible, note that 7r G NF m ^ if and only 
if there exists a sequence n of matrices with rea/ coefficients such that det <f> n — 1 and 
7r o (j) n — y 0. Hence, we find that 

iVFm,d = {tt G H m>d ; ir m , d (7r) = 0}. 

Denoting by I m d the set of invariant polynomials on H m( i with complex coefficients, a 



difficult theorem of 61 states that 

= {* G H m , d ; Q(tt) = for all Q G l£ J. 



iVF„ 



It is not difficult to check that if Q — Qi + iQi where Qi and Q2 have real coefficients, 



then (2.47) holds for Q if and only if it holds for both Qi and Q2, i.e., Qi and Q2 are also 



invariant polynomials. Hence denoting by E m ^ the set of invariant polynomials on H m c ; 
with real coefficients, we have obtained that 

NF m4 = {tt G H m4 ; Q(tt) = for all Q G I m4 }, 

which concludes the proof. 



Theorem 2.6.3. If the pair (m, d) is compatible, then there exists a polynomial K on 
!Hm,d ( we se t r — degKj and a constant C > such that for all n G JH m d , 



C 



(2.48) 



Furthermore any polynomial K satisfying the above equivalence needs to be an invariant 
polynomial on H m . If the pair (m,d) is not compatible, then there does not exist such a 
polynomial K. 



Proof: The proof of the invariance of any polynomial K satisfying (2.48), and of the 
nonexistence property when the pair (m, d) is not compatible, is reported in the appendix. 
Assume that the pair (m,d) is compatible. We follow a reasoning very similar to ^2.5 to 
prove the equivalence (2.48). 

We use the notations of Lemma 12.6.21 and we consider the set 

NF m4 = {tt G M m4 ; K m>d (ir) = 0} = {tt G M m4 ; Q(tt) = 0, Q G I m , d }. 

The ring of polynomials on a field is known to be Noetherian. This implies that there exists 
a finite family Qi, ■ ■ ■ , Q s G R m ,d of invariant polynomials on M m d such that any invariant 
polynomial is of the form ^2 PiQi where are polynomials on M m d . We therefore obtain 

NF m>d = {tt G M m4 ; Q 1 (n) = ■■■ = Q s (n) = 0}, 

which is a generalization of ( 2.45| ), however with no clear bound on s. 
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We now fix such a set of polynomials, set r := 21cnii<j< s degQj, and define 



K = J]Q; lcs?Q! and K eq 



K. 



i=l 



Clearly, K is an invariant polynomial on H m d , and NF n 



{n em 



K( 



7T 



0}. 



Hence the function K eq is continuous on H m ^, obeys the invariance property K i 
I det i 



eq 



7TO( 



l^e q (vr) 



and for all n G H m , i^ eq (7r) 



implies K(7r) = and therefore K m ^(7r) 



0. We recognize here the hypotheses of Proposition 2.5.4 except that the dimension d has 



changed. Inspection of the proof of Proposition 2.5.4 shows that we use only once the fact 
that d = 2, when we refer to Proposition 2.2.4 and state that if (ir n ) G H m , 7c n — > tt and 
Kmij^n) 0, then K m (n) = 0. This property also applies to K m>dl when the pair (m,d) 



is compatible. Assume that 



.7Tr. 



G H md , 7r n — >■ 7r and that K md (iT n ) — » 0. Then there 



exists a sequence of linear changes of coordinates 
Therefore, 



det i 



KM 



lim K(7r r[ 

n— >oo 



lim K(7r ri 

n— >oo 



o </> ri 



1, such that 7r n o (p n — > 0. 



0. 



0. Since the rest of the proof of Pro- 



It follows that 7r G NF m t i, and therefore K m ^\Ti 
position 2.5.4 never uses that d = 2, this concludes the proof of equivalence (12.481) 



o 



Hence there exists a "simple" equivalent of K m ^ for all compatible pairs (m, d), while 
equivalents of K m ^ for incompatible pairs need to be more sophisticated, or at least 
different from the root of a polynomial. This theorem leaves open several questions. The 
first one is to identify the list of compatible pairs (m,d). It is easily shown that the 
pairs (m, 2), m > 2, and (2,d), d > 2 are compatible, but this does not provide any 
new results since we already derived equivalents of the shape function in these cases. 
More interestingly, we show in the next corollary that the pair (3, 3) is compatible, which 
corresponds to approximation by quadratic elements in dimension 3. There exist two 
generators S and T of 13^, whose expressions are given in [83] and which have respectively 
degree 4 and 6. 

Corollary 2.6.4. {/\S\ 3 + T 2 is equivalent to K 3 , 3 on H 3i3 . 

Proof: The invariants S and T obey the invariance properties S^tt o <ft) — (det 0) 4 5'(7r) 
and T(7r o 0) = (det 0) 6 T(7r). We intend to show that if 7r G H 3 3 and S(tt) = T(jr) = 0, 
then K 33 (ir) = 0. Let us first admit this property and see how to conclude the proof 



of this corollary. According to Lemma 2.6.2 the pair (3, 3) is compatible. The function 
.Keq := {/(^l 3 + T 2 is continuous on H33, obeys the invariance property K eq (n o <j>) = 
I det <f)\K eq (ir), and is such that i^" cq (7r) = implies ^3,3 (7r) = 0. We have seen in the proof 
of Theorem 12.6.31 that these properties imply the desired equivalence of K e „ and K- 



>-3,3- 



We now show that S(ir) = T(ir) = implies 1^33(71") = 0. A polynomial n G H 3)3 can 
be of two types. Either it is reducible, meaning that there exist ~K\ G Hi^ (linear) and 
7T2 G 112,3 (quadratic) such that 71 = 7Ti7T2, or it is irreducible. In the latter case, according 
to |60] , there exists a linear change of coordinates and two reals a, b such that 



7T o = y 2 z — (x 3 + 3axz 2 + bz 3 ). 
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A direct computation from the expressions given in [83 1 shows that S(n o 0) = a and 
T(tt o 0) = -46. If S(n) = T(tt) = 0, then S(n o 0) = T(vr o 0) = and vr o = y 2 2 - x 3 . 
Therefore for all A ^ 0, n o 0(Ax, X 2 y, X~ 3 z) = Xy 2 z — X 3 x 3 , which tends to as A — > 0. 
We easily construct from this point a sequence n , det0 n = 1, such that 7r o n — y 0. 
Therefore, ^3,3 (7r) = 0. 

If 7r is reducible, then 7r = 7Ti7T2 where 7Ti is linear and 112 is quadratic. Choosing a 
linear change of coordinates such that tti o = 2, we obtain 

7r o = 3z(ax 2 + 26a;?/ + q/ 2 ) + ;z 2 (-ua; + vy + wz) 

for some constants a, 6, c, w, v, w. Again, a direct computation from the expressions given 
in [83] shows that S(tt o 0) = — (ac — 6 2 ) 2 (and T(ir o 0) = 8(ac — 6 2 ) 3 ). Therefore, if 
S(tt) = T(tt) = then the quadratic function ax 2 + 2bxy + cy 2 of the pair of variables 
(x, y) is degenerate. Hence there exists a linear change of coordinates ip, altering only the 
variables x,y, and reals u,,u',v' such that 

7r o o -0 = u.zx 2 + z 2 {u'x + v'y + wz). 

It follows that 7r o o ip{x, A -1 ?/, Xz) tends to as A — > 0. Again, this implies that 
■^3,3(71") — 0, and concludes the proof of this proposition. 



We could not find any example of an incompatible pair (m,d), which leads us to 
formulate the conjecture that all pairs (m, d) are compatible (hence providing "simple" 
equivalents of K m ^ in full generality). Another even more difficult problem is to derive a 
polynomial K of minimal degree for all pairs (m, d) which are compatible and of interest. 

Last but not least, efficient algorithms are needed to compute metrics, from which 
effective triangulations are built that yield the optimal estimates. A possibility is to follow 



the approach proposed in 23 , i.e., solve numerically the optimization problem 



inf{det M ; M e Sj and G R d , (Mz, z) > |vr(;z)| 2/m }, 

which amounts to minimizing a degree d polynomial under an infinite set of linear constraints. 
When d > 2, this minimization problem is not quadratic, which makes it rather delicate. 



Furthermore, numerical instabilities similar to those described in Remark 2.4.7 can be 
expected to appear. 



2.7 Conclusion and Perspectives 



In this chapter, we have introduced asymptotic estimates for the finite element in- 
terpolation error measured in the L p norm when the mesh is optimally adapted to the 
interpolated function. These estimates are asymptotically sharp for functions of two va- 
riables, see Theorem 2.1.2 and precise up to a fixed multiplicative constant in higher 
dimension, as described in ^2.6 They involve a shape function K mjP (or K m ^- P if d > 2) 



which generalizes the determinant which appears in estimates for piecewise linear interpo- 
lation [4 27 . This function can be explicitly computed in several Theorem 12.4.1 
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shows, and has equivalents of a simple form in a number of other cases, see Theorems 
l2XTl and l2ir3l 

All our results are stated and proved for sufficiently smooth functions. One of our 
future objectives is to extend these results to larger classes of functions, and in particular 
to functions exhibiting discontinuities along curves. This means that we need to give 
a proper meaning to the nonlinear quantity K mp (^/) for nonsmooth functions. This 
question is addressed in Chapter 4. 

This chapter also features a constructive algorithm (similar to El), that produces trian- 
gulations obeying our sharp estimates, and is described in §2.3.2| However, this algorithm 
becomes asymptotically effective only for a highly refined triangulation. A more practical 



way to produce quasi-optimal triangulations is to adapt them to a metric, see 15, 19 66 



and Chapter 5. This approach is discussed in §2.4.2| This raises the question of genera- 
ting the appropriate metric from the (approximate) knowledge of the derivatives of the 
function to be interpolated. We addressed this question in the particular case of piecewise 
quadratic approximation in two dimensions in Theorems 2.4.4 and |2.4.6~ 



We plan to integrate this result in the PDE solver FreeFem++ in the near future, 
see Chapter 3 for a synthetic example. Note that a Mathematica source code is already 



available on the web 95 . We also would like to derive appropriate metrics for other 



settings of degree m and dimension d, although, as we pointed out in Proposition 2.4.7 
this might be a rather delicate matter. 

We finally remark that in many applications, one seeks for error estimates in the Sobo- 
lev norms W l,p (or W m,p ) rather than in the L p norms. Finding the optimal triangulation 
for such norms requires a new error analysis, see Chapter 3. For instance, in the survey [85] 
on piecewise linear approximation, it is observed that the metric h n = \d 2 f\ (evoked in 
Equation (2.40)) should be replaced with h n = (d 2 f) 2 for best adaptation in H 1 norm. In 
other words, the principal axes of the positive definite matrix h n remain the same, but its 
conditioning is squared. 



2.8 Appendix 



2.8.1 Proof of Proposition 2.4.6 



We consider a fixed polynomial n G H3, set a parameter a > 0, and look for an ellipse 
E n)0l of maximal volume included in the set or x l 2 D fl A^. Since this set is compact, a 
standard argument shows that there exists at least one such ellipse. 

If a > /in, then a~ 1/2 D C A^, and therefore a~ l/2 D nA T = a~ l/2 D. It follows that 
Eva = a~ 1/2 D, which proves part 1. 



In the following, we denote by E' a the ellipse defined by the matrix (2.41). Note that 
any ellipse containing D n and included in A^ must be tangent to dA^ at the point z n , 
and hence of the form E' s for some 5 > 0. Clearly, E' s C E'^ if and only if 8 > /i. 
Therefore, E' a C A„- if and only if a > a n . Let E be an arbitrary ellipse, D\ the largest 
disc contained in E, and D2 the smallest disc containing E. Then it is not hard to check 



that \E\ = ylDxl |D 2 |. For any a satisfying a w < a < [i^^ the ellipse E' a is such that 
D\ = D n , which is the largest centered disc contained in A„., and D 2 = or x l 2 D^ which 
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corresponds to the bound 2a x l 2 on the diameter of E^ j(X . It follows that E' a is an ellipse 
of maximal volume included in oT x l 2 D fl A,,., and this concludes the proof of part 2. 

Part 4 is trivial ; hence we concentrate on part 3 and assume that /3 n < a < a n . 

An elementary observation is that E n<a must be "blocked with respect to rotations." 
Indeed, assume for contradiction that Rg(E ir>a ) C A^ for 9 G [0, e] or [— e, 0], where we 
denote by Rg the rotation of angle 9. Observing that the set \Je^[o !e ]Re(E ntCC ) contains an 
ellipse of larger area than E Vj0l and of the same diameter, we obtain a contradiction. 

In the following, we say that an ellipse E is quadri-tangent to A„. when there are at 
least four points of tangency between dE and dA n (a tangency point being counted twice 
if the radii of curvature of dE and dA n coincide at this point). 

The fact that E^ )a is "blocked with respect to rotations" implies that it is either quadri- 
tangent to A n or tangent to dA v at the extremities of its small axis. In the latter case, 
the extremities of the small axis must clearly be the points z v and — z w , the closest points 
of dA n to the origin. It follows that E v ^ a belongs to the family E' s , 5 > a n described 
above, and therefore is equal to E' a ^ since a < a n . But E' is quadri-tangent to A„-, since 
otherwise we would have E' a ^_ £ C A„- for some e > 0. 

We have now established that E n>a is quadri-tangent to A^ when f3 n < a < a n . This 
property is invariant by under linear change of coordinates : if an ellipse E is quadri- 
tangent to A^, then <f>(E) is quadri-tangent to A n . Furthermore, if E is defined by a 
symmetric positive definite matrix H, then <f)(E) is defined by ((j)~ 1 ) T H(j)'' 1 . This remark 
leads us to the problem of identifying the family of ellipses quadri-tangent to 8A n when 
7r is among the four reference polynomials x(x 2 — 3y 2 ), x(x 2 + 3y 2 ), x 2 y, and x 3 . In the 
case of x 3 there is no quadri-tangent ellipse and we have a n = ; therefore part 3 of the 
theorem is irrelevant. In the three other cases, which respectively correspond to part 3 
(i), (ii), and (iii), the quadri-tangent ellipses are easily identified using the symmetries of 



these polynomials and the system of equations (2.37). 



The ellipses quadri-tangent to x(x + 3y ) are defined by matrices of the form H, 



x 



diag(A, where < A < 2. Note that det H\ is decreasing on (0,2s] and increasing 



on [23,2]. Given n with disc7r < 0, the optimization problem (2.39) therefore becomes 

min{det# A ; (^f H^ 1 > aid}. 

A 

If the constraint is met for A = 2 1//3 , we obtain E n>a = E n and therefore a < f3 n . Otherwise, 
using the monotonicity of A h- >■ det H\ on each side of its minimum 2 5 we see that the 
matrix H\ — aup^fa must be singular. Taking the determinant, we obtain an equation of 
degree 4 from which A can be computed, and this concludes the proof of part 3 (i). 

The ellipses quadri-tangent to x(x 2 — 3y 2 ) are defined by H xy = V T diag(A, ^)V, 
where < A < 1 and V is a rotation by 0, 60, or 120 degrees. Since det H\ V is a decreasing 
function of A on (0, 1], we can apply the same reasoning as above to polynomials ir such 
that disc7r > 0. This concludes the proof of part 3 (ii). 

Finally, the ellipses quadri-tangent to xy 2 are defined by H\ = diag(A, 2A2); A > 0. 
The determinant A h-> det H\ is a decreasing function of A with lower bound as A — > 00, 
and the same reasoning applies again, hence concluding the proof of part 3 (iii). 
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2.8.2 Proof of the nonexistence property in Theorem |2.6.3 



Let K be a polynomial on H m d which satisfies the inequalities (2.48). We observe that 
K takes nonnegative values on H m( j, and we assume in a first time that /i : = 



is an 



integer. We derive from (2.48) and from the invariance of the shape function K m d with 



(2.49) 



respect to changes of variables the inequalities 

C- 2r (det0fK(7r) < K(tto0) < C ,2r (det0fK(7r), 

for all 7r G E m)( j and all G Md (the space of d x d real matrices), where C is the constant 
appearing in inequalities (2.48 ). We regard the function Q(7r, 0) = K(7ro0) as a polynomial 
on the vector space V = H m ,<j x Md and we observe that it vanishes on the hypersurface 
Vdet '■= {(tt, 0) £ V ; det = 0}. Since (-)■ det(0) is an irreducible polynomial, as shown 
in [ll], it follows that Q(n,<f>) = (det 0)Qi(7r, 0) for some polynomial Qi on V. Injecting 
this expression in (2.49) we obtain that Qi(7r, <f] also vanishes on the hypersurface Vdet if 
\i > 1, and the argument can be repeated. By induction we eventually obtai n a po lynomial 
K on V such that K(7r o 0) = (det 0) M K(-7r, 0). It follows from inequality (2.49) that for 
all (tt, 0) G V, 

C~ 2r K(7r) < K(tt,0) < C 2r K(7r). 

This implies that K(7r, 0) does not depend on 0. Otherwise, since it is a polynomial, we 
could find 7Ti G i? m d and a sequence n G M rf such that |K(7Ti, n )| — >■ oo. Therefore, 



K(vro 



(det. 



'K(tt, 



(det 0) m K(tt, Id) = (det0fK(7r) 



This establishes the invariance property of K under the hypothesis that \x is an integer. 
Let us assume for contradiction that // is not an integer. Then applying the previous 
reasoning to K d we obtain that K(7T o 0)/K(7r) = a(7r,0)| det0| M for all 7r G H miC / and 



is an irreducible polynomial we 
■ (det 0) M K(7r), which concludes 



G Md, where a : H mid x M d — > { — 1, 1}. Since H- det . 
obtain that fi is an integer and as before that K(7r o 0) : 
the proof of the invariance property. 

Let (m, c?) be an incompatible pair. We know from Lemma 2.6.2 that there exists 
7r G H m d such that i^ TO) d(7To) > and Q(n ) = for any invariant polynomial Q G I m ,d- 



Therefore there exists no polynomial K satisfying (2.48) 
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3.1 Introduction 

We consider in this section the problem of finding the optimal mesh for the interpola- 
tion of a function by finite elements of a given degree, when the error is measured in the 
Sobolev W 1,p norm. Our purpose is to establish sharp asymptotic estimates, of the same 
type as the following in the case of the LP norm, 

limsupjV min ||/-I^/|| LP ) < C\\ yf\ det (<£*/) 1 11^, - = - + 1, (3.1) 

N^+oo V #(T)<JV ' J Tp 

which holds for a C 2 smooth function / defined on a bounded polygonal domain Q, 
see [4j|27] and Chapter 2. We refer to the introduction of this thesis and to Chapter [2] 
for a more detailed overview of existing results and of our motivations. The convergence 
estimate (3.1) is extended to arbitrary approximation order in Chapter |2j where the 



quantity governing the convergence rate for finite elements of arbitrary degree m — 1 is 
identified. This quantity, referred to as the shape function, depends nonlinearly on the 
m-th order derivative d m f. 

The purpose of the present chapter is to investigate this problem when the L p -norm 
is replaced by the W l,p semi-norm which plays a critical role in PDE analysis, and which 
is defined as follows 

ijv.*(o) := iiv/ii w(0) = (7i v ir) lP ■ 

Our second objective is to propose simple and practical ways of designing meshes which 
behave similar to the optimal one, in the sense that they satisfy the sharp error estimate 
up to a fixed multiplicative constant. 



3.1.1 Main results and layout 

We denote by P m the space of polynomials of total degree less or equal to m and by 
H m the space of homogeneous polynomials of total degree m, 

P m := Span{x k y l ; k + I < m} and H m := Span{x k y l ; k + I = m}. 

For any triangle T, we denote by I™ the local interpolation operator acting from C°(T) 
onto P m . For any continuous fonction v G C°(T), the interpolating polynomial I™ v G P m 
is defined by the conditions 

for all points 7 G T with barycentric coordinates in the set {0, • ,1}. This inter- 

polation operator is invariant by translation, hence for any polynomial tt G H m , triangle 
T and offset z we have 

\ix — I™ -1 7t\wi, P (t) = |vr — 7r|wi,p(j(-Kr). (3-2) 

If T is a triangulation of a domain Q, then I™ refers to the interpolation operator which 
coincides with I™ on each triangle T G T. 
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A key ingredient of this chapter is the shape function L mjP , which is defined by a shape 
optimization problem : for any fixed 1 < p < oo and for any n G H m , we define 

Lm,p{n) '■= inf |vr — I™" 1 ii\ w i,pm. (3.3) 
| T | =1 

Here, the infimum is taken over all triangles of area |T| = 1. From the homogeneity of ir, 
it is easily checked that 

1 / \ m — 1 I 1_ 

inf 1 7r - I™" 7r| w i,p( T ) = L mtP (ir)A 2 p. 

\T\=A 

The solution to this optimization problem thus describes the shape of the triangles of a 
given area which are best adapted to the polynomial tt in the sense of minimizing the 
interpolation error measured in W 1,p . 

The function L mtP is the natural generalisation of the function K m ^ p introduced in 
Chapter [2] for the study of optimal anisotropic triangulations in the sense of the LP inter- 
polation error 

K m , P (K) := inf \\tt - I™" 1 ir\\ L prn). 

,F\ | T | =1 

Our asymptotic error estimate for the optimal triangulation is given by the following 
theorem. 

Theorem 3.1.1. For any bounded polygonal domain Q G M 2 , any function f G C m (Q) 
and any 1 < p < oo, there exists a sequence (Tn)n>n , #(7/v) < N, of triangulations of 
Q such that 



limsupiV^I/ -I™" 1 f\ w i, m < 



d m f 
ml 



1 m — 1 1 

, where - := — 1 — . (3.4) 

L r (Q) T 2 p 



In the above convergence estimate, the number Nq is independent of / and refers to 
the minimal cardinality of a conforming triangulation of Q. The m-th derivative d m f(z) 
at each point z is identified to a homogeneous polynomial tt z G H m : 



d rn f(z) dm f r„\ xk y l 

k+l=m 



An important feature of this estimate is the "limsup". Recall that the upper limit of a 
sequence (maO n> No is defined by 

limsup Mat := lim sup u n , 

and is in general stricly smaller than the supremum sup 7V>7Vo u^. It is still an open question 

to find an appropriate upper estimate of sup Ar>A r N^^lf — I^" 1 f\w l <p(p,) when optimally 
adapted anisotropic triangulations are used. 

In order to illustrate the sharpness of (3.4), we introduce a slight restriction on se- 
quences of triangulations, following an idea in |4| : a sequence (Tn)n>n °f triangulations 
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is said to be admissible if #(7jv) < N and svp N>N (N* sup TeT diam(T)) < oo. In other 
words if 

sup diam(T) < C A N~^ (3.6) 
TeT N 

for some constant Ca > independent of N. The following theorem shows that the 



estimate (3.4) cannot be improved when we restrict our attention to admissible sequences. 



It also shows that this class is reasonably large in the sense that (3.4) is ensured to hold 
up to small perturbation. 



Theorem 3.1.2. Let Q C 

p < oo. We define \ := m =^ 
ofVt, one has 



+ 



be a bounded polygonal domain, let f e C m (Q) and 1 < 
-. For any admissible sequence (7~n)n>n of triangulations 



liminfiV^|/-I™- 1 /ki.^)> 



L 



m,p 



oTf 



ml 



Furthermore, for all e > there exists an admissible sequence {T^)n>n 
of Q such that 



(3.7) 



of triangulations 



lim sup 

JV-s>oo 



I/- Its /I 



A' 



< 



J m,p 



d m f 



ml 



(3. 



Note that the sequences (TJS 



N)N>N 



satisfy the admissibility condition (3.6) with a 



constant Ca(e) which may grow to +oo as e — > 0. The proofs of these two theorems are 
given in ^ 3.3| Theorem 3.1.1 is a direct consequence of Theorem 3.1.2[ by considering a 



sequence of triangulations of the type T^ N with en — > as — > +oo. The proof of the 



upper estimate in Theorem 3.1.2 involves the construction of an optimal mesh based on 
a patching strategy adapted from the one encountered in |4j. However, inspection of the 
proof reveals that this construction only becomes effective as the number of triangles iV 
becomes very large. Therefore it may not be useful in practical applications. 

Remark 3.1.3. It can easily be shown that if (1~n)n>n is an admissible sequence of 
triangulations and f e C m (VL), then \\f — I™ f\\LP(n) decays with the rate N~ m / 2 which is 
faster than the decay rate obtained for the W 1,p error. Therefore, our convergence estimates 
are also valid in the W 1 ^ norm 



w rl -f(n) 



V I I f IP 



1/p 



We show in ^3.2 that in order to satisfy the optimal estimate (3.4) up to a fixed multi- 



plicative constant, it suffices to build a triangulation which obeys four general principles : 

(i) The interpolation error should be evenly distributed on all triangles, 

(ii) The triangles should adopt locally a specific aspect ratio, dictated by the local 
value of d m f, 

(iii) the largest angle of the triangles should be bounded away from 7T = 3.14159 . . . 

(iv) the triangulation T should be sufficiently refined in order to adapt to the local 
features of /. 
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The third point (iii) is the main new ingredient of this chapter compared to Chapter 
[2) and is necessary for obtaining W l,p error estimates (but not for LP error estimates). 
Roughly speaking, two triangles having the same optimized aspect ratio imposed by (ii) 
may greatly differ in term of their largest angle, and the most acute triangle should be 
preferred when the interpolation error is measured in W 1 ^ rather than LP . The influence 
of large angles in mesh adaptation has already been studied in 1 10, 62 ,85], see also Chapter 
6. The heuristic guideline is that large angles should be avoided in general, since they lead 
to oscillations of the gradient of the interpolant. On the contrary, extremely thin triangles 
and very small angles can be necessary for optimal mesh adaptation. 

A practical approach for mesh generation is discussed in §3.4[ and consists in deriving 
a distorted metric from the exact or approximate data of d m f at each point x G Q. We 
restrict in this section to the case of linear and quadratic finite elements, and we provide 
simple mesh generation procedures and numerical results. To any 7r G H m , m G {2,3}, we 
associate a symmetric positive definite matrix A4 m (7r) G S 1 ^ (strictly speaking, this matrix 
is degenerated if 7r is univariate, a detail that needs to be taken care of in the numerical 
implementation). If z G £1 and d m f(z) is close to n, then the triangle T containing z 
should be isotropic in the metric A4 m (n). The requirements (i) and (ii) above, which are 
respectively linked to the size and shape of the triangles, can then be summarized through 
a global metric on Q given by 

h(z) = s(n z )M m (7r z ), n z = d -^, (3.9) 

ml 

where s(ir z ) is a scalar factor which depends on the desired accuracy of the finite element 
approximation. Once this metric has been properly identified, fast algorithms such as 
in fT9][93lf94] can be used in order to design a near-optimal mesh based on it. Recently 



it has been rigorously proved in 15 66 , that several algorithms terminate and produce 
good quality meshes, under certain conditions. Although we are not aware that the angle 
constraint (iii) is guaranteed in such algorithms, it seems to hold in practice. Computing 
the map 

7r G H m ^ M m (n) G S 2 + , 
is therefore of key use in applications (S2 refers to the set of 2 x 2 symmetric and non- 



negative matrices). This problem is solved in 66 , in the case of linear elements (m = 2) : 
the matrix Ai m (7r) is then defined as the square of the matrix associated to the quadratic 
form 7r. We give a simple expression of Ai m (7i) for piecewise quadratic finite elements (m = 
3). The optimality of this construction is proved theoretically, and numerical experiments 
confirm its adequacy. An open source implementation for the mesh generator FreeFem+-|- 



93 is provided at 95 



The shape function L mp does not always have a simple analytic expression from the 



coefficients of n. For this reason we introduce in £3.5 explicit functions it i— > h m (n) which 
are defined as the root of a polynomial in the coefficients of 7r, and are equivalent to L m}P , 
leading therefore to similar asymptotic error estimates up to multiplicative constants. 



We finally discuss in £3^3 the possible extension of our analysis to simplicial elements in 
dimension d > 2. 
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Notations 

Throughout this chapter, we define L m := L moo , where L m „ is defined at Equation 



(3.3). We prove further in Lemma 3.2.7 that for all 1 < p < oo 



cL m < L m . p < L m on H m , (3.10) 

where the constant c > depends only on m. For any compact set E C R , of non-zero 
Lebesgue measure, we denote by bary(E') its barycenter. For any pair of vectors u, v G R d , 
we denote by (u,v) their inner product, and by 



\u\ 



V (u,u), 



the euclidean norm of u. When g G L p (i?,IR d ) is a vector valued function, we denote by 
IM|jy»(.E) the LP norm of x h-> \g{x)\ on E. 

We denote by M<j(R) the set of all d x d real matrices, equiped with the norm 

\\A\\ := max \Au\. 

\u\<\ 

We denote by GL^ C Md(R) the linear group of invertible matrices and by SL^ C GL^ 
the special linear group of matrices of determinant 1. 

GL d := {A e M d (R) ; detA^ 0} and SL d := {A G M d (R) ; det A = 1}. 

For A G GLd, we denote by 

k(A) := ||A|| (3.11) 

its condition number. We denote by Sd C M<j(R) the subset of symmetric matrices, by 
Sf C Sd the subset of non-negative symmetric matrices and by S% the subset of positive 
definite symmetric matrices. For any two symmetric matrices S, S' G Sd, we write S < S' 
if and only if S' - S G S®. 

We equip the spaces P m and H m with the norm 

||7r|| : = max 1 7r (ii) I . (3-12) 

\u\<l 

Note that the greek letter ir always refers to an homogeneous polynomial tt G H m , while 
the large and bold notation 7T refers to the mathematical constant 7T = 3.14159 .... 



Recall that if g is a C m function, we identify d m g(x) to a polynomial in H m , by (3.5). 
We then denote 

||^||l-(s) := max \\d m g(z)\\ (3.13) 
with || • || the previously defined norm on H m . 



3.2 The shape function L mp and local error estimates 

In this section, we study the function L m ^ and obtain local W 1,p error estimates 
for functions of two variables. These estimates naturally give rise to a heuristic method 
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for the design of "near optimal" triangulations adapted to a function /, in other words 



triangulations satisfying the estimate (3.4) up to a fixed multiplicative constant, and it 
is put into practice in £3A_ in the case of piecewise linear and piecewise quadratic finite 
elements. The results of this section are also useful to the proof, in §3.3| of the optimal 
error estimates presented in Theorems 3.1.1 and |3.1.2| 



We first introduce the measure of sliverness S(T) of a triangle T. Given two triangles 
T,T', there are precisely 6 afline transformations \& such that \&(T) = T'. Each of these 
affine transformations defines a linear transformation ip and we set 

d(TX) ■= ln(inf{K#) ; *(T) = T"}) , (3.14) 



where K(ip) is the condition number defined in (3.11). Clearly d(T,T') > 0, d(T,T') = 
d(T, T) and d(T, T") < d(T, T') + d(T, T"). Furthermore d(T, T') = if and only if T can 
be transformed into T" through a translation, a rotation and a dilatation. Therefore d(-, •) 
defines a distance between shapes of triangles. The heuristic guideline of the papers [l0][62] 
and Chapter 6 is that obtuse shapes should be avoided when possible in the design of 
Finite Element meshes for the approximation of a function in the W 1,p norm. We therefore 
introduce the set A of all acute triangles and we define the measure of sliverness S(T) of 
a triangle T as follows 

S(T) := exp d{T, A) = inf{«(^) ; *(T) G A}. (3.15) 

This quantity reflects the distance from T to the set of acute triangles : in particular 
S(T) = 1 if and only if T G A, and S(T) > 1 otherwise. It has an analytic expression, 
which is given in the following proposition. 

Proposition 3.2.1. For any triangleT with largest angle 9, one has S(T) = max{l,tan |}. 

Proof: The result of this proposition is trivial if the triangle T is acute, and we therefore 
assume that T is obtuse. We can assume without loss of generality that the vertices of 
T are 0, au and (3v, where a, j3 > 0, u, v e 1R 2 , \u\ = \v\ = 1 and (u,v) = cos 9. Note 
that \u-v\ = 2sin(0/2) and \u + v\ = 2cos(0/2). Let * be such that \&(T) G A, and let 
ip be the associated linear transform. Since \&(T) is acute we have (ip(u),ip(v)) > and 
therefore \ip{u) — ip{y)\ < \ip{u) + ip{y)\. It follows that 

«w - m nri > x M4±|M > »l = t * 

\u + v\ \yj(u) — yj(v)\ 2cos(6y2) 2 

Therefore 5(T) > tan |. Furthermore, let ^ be defined by ip(u) = (0, 1) and ij)(v) = (1, 0). 
Obviously ip{T) has one of its angles equal to 7T/2 and is therefore acute. On the other 
hand, one easily checks that K(ip) = tan(#/2) and therefore S(T) < tan |. This concludes 
the proof of this proposition. o 

The previous proposition implies in particular that S(T) is equivalent to the quantity 
jt^q, where 9 denotes the largest angle of T, which is used in 10 62 . The following lemma 



shows that the interpolation process is stable with respect to the L°° norm of the gradient 
if the measure of sliverness S(T) is controlled. Let us mention that a slightly different 



formulation of this result was already proved in 62 , yet not exactly adapted to our 
purposes. 
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Figure 3.1 - The interpolation points on the triangle T are aligned vertically. 



Lemma 3.2.2. There exists a constant C = C(m) such that for any triangle T and any 
f E W X >°°(T) one has 



IVI 



m—1 



(3.16) 

l.oc / 



f\\L°°(T)<CS(T)\\Vf\\ L ~ {T ), 

Proof: Let T be the triangle of vertices (0,0), (1,0) and (0,1), and let g G W l ^{T 
We define g(x, y) := g(x, 0) and h(x, y) := g(x, y) — g(x, 0). Since g does not depend on y 
and si nce t he Lagrange interpolation points on To are aligned vertically, as illustrated on 
Figure 



3.1 



the Lagrange interpolant I™ 1 g does not depend on y either. Futhermore, for 





all (x,y) e T , we have \h(x,y)\ = | j v s=Q ^{x,s)ds\ < ||§J||x<»czb)- Hence 









dy 


L°°(T ) 


dy 



L°°(T ) 
— Coll Ijb ^I|l°°(To) 

< CoCi||/i||ioo(r ) 



dy 



L°°(T ) 



where the constants Cq and C\ are the L°°(To) norms of the operators g i— )■ || restricted 
to P m _i and g i-)- V^~ x g respectively. 

Let e be an edge vector of T. There exists an afline change of coordinates \l/ e , with 
linear part ift e , such that T = ^e(T) and V'e(e) = eo where eo = (0, 1) is the vertical edge 
vector of T . Noticing that 



e, V \™-\g o *„)) = (e, V((I^ 1 g) o * e )> = (e , (V I^ 1 <?) o e > 



To 



9y 



o \& 



we obtain 



(e.VIT 1 



|l°°(t) 



q Tin — 1 



< OA 



L°°(T ) 



L°°(T ) 



(3.17) 



C Ci||(e,V(po^ e ))|| LO o (T) . 
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Applying this inequality to g = f o \I/ e 1 we obtain that 

IKe^irV)!!^^) ^CoC^Vfih^, (3.18) 
for all edge vectors e 6 {a, b, c} of T. We next define a norm on M 2 as follows 

\v\ T := |a|- 1 |(a,7;)| + |6|- 1 K6,«)| + | C |- 1 Kc,7;)|. 
It follows from inequality ( 3.18[ ), that 



II | vi?- 1 /It lU-m^aCoCJiiiiv/lTlU-cT). (3.19) 

We next observe that if # denotes the maximal angle of T, 

cos(0/2)|u| < |v| T < 3|v|, 

where | • | is the euclidean norm : the upper inequality is trivial and the lower one is 
implied by the fact that at least one of the edge vectors makes an angle less than 9/2 with 



v. Combining this with (3.19), we obtain 



l|VirVll^(T)<^^y||V/|| LO o (T) . 

Since 9 > 7r/3 we have cos (gy 2 ) < 2tan(0/2) < 2S(T) according to Proposition 
concludes the proof with C = I8C0C1. o 



3.2.1 



which 



Remark 3.2.3. The following example in the simple case of piecewise linear approxima- 



tion illustrates the sharpness of inequality (3.16). Let T be a triangle having an obtuse 



angle 6 at a vertex v, and edges neighbouring v of length I and I'. Let f(z) := \z — v\ 2 . A 
simple computation shows that 

dlcLlXX 'J. ' 

||VIr/|U°°(T) = ■ a and ||V/||l°°(t) = 2max(Z,f). 
suit/ 

It follows that 

||VI^/|U- (T) = A(T)S'(T)||V/|Ucc (r)j 

with 

\m = diam ( T ) = diam ( T ) p ri 1 a 11 

1 J ' 2sin(0)5(T)max{/,Z'} 4 sin(0/2) 2 max{/, /'} 1 1 ' J ' 



which shows the sharpness of Lemma 3.2.S\ in this context. 



We now introduce for each polynomial n G H m , a special set C M 2 (R) of linear 
maps. 

A n := {A e M 2 (R) ; \Vir{z)\ < |^| m_1 for all z e R 2 }. (3.20) 

This set has a geometrical interpretation : since V7r is homogeneous of degree m — 1, we 
find that A e A n if and only if the ellipse {z e IR 2 ; < 1} is included in the set 
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{z 6 I 2 ; |V7r(z)| < 1} which is limited by the algebraic curve {|V7r(z)| = 1}. If T is a 
triangle that contains the origin and if A G A n , we observe that 

||V7r|| L oo (T) < dia,m(A(T)) m -\ (3.21) 

We define 

7 m (vr) := inf{|detA| ; A G A w }, 
so that is the maximal area of an ellipse contained in {z G IR 2 ; |V7r(;z)| < 1}. 

Remark 3.2.4. Similar concepts have been introduced in [23] for the purpose of studying 
the L p interpolation error of anisotropic finite elements, therefore with \7t(z)\ in place of 
|Vtt(z)|. 

The following result shows that a certain power of 7 m is equivalent to the shape function 
L m . For any domain ftcK" 1 and any f,g£ L P (Q) we use the shorthand 

\\(f,9)\\Lnn) := ^lj(f(z),g(z))\ p d z y = ^^ + 9^^. (3.22) 
Lemma 3.2.5. There exists a constant C = C(m) such that for all tt G M m 

C- l L m (n) < 7 m (vr)^ < CL m (n). (3.23) 



Proof: We first prove the left part of (3.23). Let ir G H m and A G An such that A is 
invertible. The matrix A admits a singular value decomposition 

A = UDV, 

where U, V are unitary and D = diag(Ai, A 2 ) with Aj > such that A 2 are the eigenvalues 
of A T A. Let T be the triangle of vertices (0, 0), (0, a/2) and (a/2, 0). We define the triangle 

T := ^/IdetA^D-^To), 

which satisfies \T\ = \Tq\ = 1 and has an angle of 7T/2 at the origin so that S(T) = 1. 
Denoting by C the constant in Lemma 3.2.2 and using (3.21), we obtain 

(l+C^HVTr-VI^TrlU-cr) < || Vtt|| L oo (t) < diam^T))™- 1 = | det A\ ^ diam{T ) m -\ 

Taking the infimum over all invertible A G A n and remarking that this set is dense in 
An, we conclude the proof of the left part of (3.23). For the right part, we define for all 
qi, g 2 G M m _i, and any triangle T, 

11(91,92)112-:= inf || q 2 ) ~ (ri, r 2 )\\ L ™(T)- (3.24) 

ri,r 2 eP m -2 

We denote by T cq an equilateral triangle centered at the origin and of area 1. Since the 
functions || • \\t c and || • || l°°(t c ) are norms on H m _ 1 x H m _! there exists a constant C t . 
such that || • ||L°°(T cq ) < C*|| • ||r eq - Let T be a triangle satisfying |T| = 1 and bary(T) = 0. 
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Then there exists a linear change of coordinates <fi G SL2 such that T = 0(T eq ). We then 
obtain 

||(gi,92)IU«(T) = 11(91° 0,92 ° 0) || L°°(T cq ) < C*||(gio0,g 2 o0)|| Te(l = C,||(gi,g2)||T 
We now choose a polynomial 7r G H. m and we set ((71,(72) := V7T. It follows from the 



previous equation and (3.24) that 

|| V7r|| L oo( T ) < C*||V7r|| r < C*||V7r — VI™ -1 ^\\l^{t) 
We define a linear map A G GL 2 associated to 7r and T = 0(T eq ) as follows 

A:= ||V7r||^ T) A-V" 1 , 

where A = 3~ 3 / 4 is the minimal distance from to <9T eq . Then for all z G dT we have 
<t>~ l {z) G <9T eq and hence |0 -1 (;z)| > A. Therefore, 

lAizT- 1 = IIWIU-cd (A- 1 |0- 1 («)|) m - 1 > ||V7r|| LO o {T) > \Vtt(z)\. 

By homogeneity, we thus find that for all z G 1R 2 

|Vvr(,)| < \A{z)\ m -\ 

which means that A G A w . Furthermore, since det0 = 1, we have 

\detA\^ = A -(«.-i)|| V7r ||_ I . ioo(r3 < ^ A -(--i)|| V7r _ vI^ttIUoc^). 

Hence taking the infimum over all triangles T satisfying |T| = 1 and bary(T) = we 
obtain 

7m(7r)^<C inf || Vvr - V I^T 1 7r|| Loo(T) (3.25) 

bary(T)=0 

with C = C*\~( rn ~ 1 \ Using the invariance of the interpolation error under translation, as 



expressed by (3.2), we find that the right hand side of (3.25) is CL m (ir), which concludes 



the proof. o 

We next introduce a measure of the isotropy of a triangle T with respect to the 
euclidean metric : 

P(T) := d '^p. (3.26) 

If T is an obtuse triangle, an elementary computation shows that 4S(T) < p(T). Indeed, 
if the largest angle of T is 9 > 7T/2, and if the edges neighbouring the angle 9 have length 
li, I2, we obtain using l\ + l\ > 2i x i 2 that 
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Since the minimal value of p is 4/ a/3 (for the equilateral triangle), and since S(T) = 1 for 
acute triangles, we obtain that for any triangle T 

p(T) > ^=S(T). (3.27) 

The functions S and p have a different behavior : p(T) increases as T becomes thinner, 
while S(T) increases only if an angle of T approaches 7T. 

In the follow up of this chapter, we frequently distort the measure of isotropy p by 
a linear transform. If A G GL2, then p(A(T)) reflects the isotropy of T measured in the 
metric A T A. In particular p(A(T)) is minimal, i.e. equal to 4/Vo, if and only if the ellipse 
£ containing T and of minimal area is of the form 

£ = {z G M 2 ; \A(z - bary(T))| < r} 

for some r > 0. 

We may now state the main theorem of this section. 

Theorem 3.2.6. There exists a constant C = C(m) such that for all it G M m , all A G A n 

and any triangle T , we have 

br-I^TiV^T) < C\T\$ S(T) p(A(T))V|detA|V (3.28) 
where i := + K Furthermore for any triangle T and any g G C m (T), we have 

\g-V?- l g\w^(T) < C\T\$ S(T) p(T)^\\d m g\\ L ^ T) , (3.29) 



where \\d m g\\L°°(T) is defined by (3.13). 

Before proving this result, we make some observations on its consequences. Combining 
the two estimates contained in this theorem, we obtain a mixed anisotropic-isotropic 
estimate, that can be used as a guideline for producing triangulations adapted to a function 
/ G C m (Sl). Let T be a triangle, let / G C m (T), n G H m and A G A n . Then 

|/ - I™" 1 f\ W l,P(T) 

< |7r-I™- 1 7r| w/ i, P(T) + |(/-7r)-I™- 1 (/-7r)| w/ i, P(T) , (3.30) 

< C\T\7S(T) (p(A(T)) V I det A\ ^ +p(T)^\\d m f - d m 7r\\ Lac{T) ^ , 

where C = C(m). Note that the left term in the parenthesis is an "anisotropic" contribution 
to the error, while the right term is an "isotropic" contribution. 

Let e > and 1 < p < 00. We now explain how the requirements (i), (ii), (iii) and (iv) 
heuristically exposed in the introduction can be mathematically stated, and show that 
the estimate 

#(Tf^\f ~ I T~ 1 /lm*(n) < C \\L m (7r z ) + s\\ LT{n) , 
is met when the triangulation satisfies these requirements. Consider a polygonal and 
bounded domain Q, a function / G C m (Q) and a triangulation T. For each z G Q, we 
denote by T z G T the triangle containing z and define tt z = d G H m . The adaptation 
of T with respect to / for the W 1,p semi-norm, can be measured by the smallest constant 
C7- > 1 such that the following criterions are met : 
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(i) (Equilibrated errors) There exists a constant 5 > such that for all zGfl, 

C T l 5 < \T z \^(L m (ir z ) + e) < C T 5. (3.31) 

(ii) (Optimized shapes) For all z & Q, there exists A z e A^, such that 

p{A z {T z )) < C T and | det < C L {L m {n z ) + e), (3.32) 

where Cl is the constant that appears in Lemma ( 3.2.5| ). According to this lemma, 
such an A z always exists for any e > 0. 

(hi) (Bounded sliverness in average) The averaged P(T) norm of S is bounded as follows 



l — S(Ty) P < Or. 



#(T) 



(3.33) 



TeT 



This condition is less stringent than asking that S(T) < Cj- for all T e T, and turns 
out to be sufficient for proving the optimal error estimate. 

(iv) (Sufficient refinement) The mesh T is sufficiently fine in such way that the local 
interpolation error estimate(3.30) is controlled by the "anisotropic" component. More 
precisely, for all z 6 Q, 



p{T z )^\\d m f - d m f(z)\\ L ~ (Tz) < C r (L m (7r z ) + e). 



(3.34) 



This condition is ensured by sufficient refinement of the triangulation due to the 
following observation : If T' z is the image of T z by a homothetic size reduction around 
z, then p(T' z ) = p(T z ) while \\d m f — d m f (z)\\ (t^) tends to zero due to the continuity 
of d m f. 

We now produce a global error estimate from these four assumptions. For a given 
z G f2 we inject successively n = n z , (3.32), (3.34) and (3.31) into the estimate (3.30) and 
obtain 



1/ - I™* 1 f\w x 'P(T z ) 
< C\T Z \^S(T Z ) (p(A z (T z ))^\ det A z 



777 — 1 

~1T~ 



+ P(T Z 



m — 1 



d m f-d m 7T z \\ L , 



(T) 



< C\T z \rS(T z )(C r > C L (L m (^)+e)+ P (T z ) s ^\\d m f-d m f{z)\\ L oo iT) ) 

1 m — 1 

< C\T z \rS(T z )(C T * C L + C r )(L m (7r z )+s) 

< C 5S(T Z ) 



where Co = Co(m,CV, Cl). Using (3.33) we obtain 
\f f\wi,»(n) = E \f f\ P w^(T) < O p E^( T ) P ^ (CoCr) p 5 p #(T). (3.35) 



TeT 



TeT 



On the other hand, the left side of inequality (3.31) provides an upper estimate of 5 as 
follows. 

dz 



Jn \ 1 z\ Jn 



< / (L m (n z ) +e) T dz 



\L m (ir z ) 



(3.36) 



126 



Chapter 3. Sharp asymptotics of the W ' p interpolation error 



Combining (3.35) with (3.36) we eliminate the variable 5 and obtain 



#(T)^i/-irvi 



Ivy 1 * (si) 



<C\\L r 



7T, 



L T (fl) 



(3.37) 



where C = C(m,Cf). Hence the optimal asymptotic estimate (3.4) is satisfied up to 
a multiplicative constant depending only on the degree m — 1 of interpolation and the 



quality of the mesh reflected by Cf. Note however that the properties (3.31), (3.32), (3.33) 



and (3.34) required for Gf may lead to a very pessimistic constant C = C(m, Cf, Cl) in 
inequality (3.37). Finer estimates and weaker conditions on the mesh T can be obtained 



from (3.30). 



In the context of the H l = W 1 ' 2 semi-norm and of piecewise linear and quadratic 
elements we present numerical results in §3.4. 2| and discuss the quality of a numerical 
mesh T using three quantities cr(T), p(T) and S(T) that are related to the conditions 
(i), (ii) and (iii) respectively. We also discuss in £3.4 a reformulation of the requirements 



of size ( 3.31[ ) and shape (3.32) for the triangles T of the mesh T in terms of Riemannian 
metrics, a more convenient form for mesh generation. 



The construction of a mesh which satisfies both the requirements (3.32) of optimized 



shapes and (3.33) of bounded measure of sliverness is a difficult problem. The construction 



presented in this chapter, for the proof of Theorems 3.1.1 and 3.1.2, is based on a local 
patching strategy. A small portion of the triangulations, which can be neglected as the 
cardinality tends to infinity, does not satisfy conditions. Another approach to this mesh 



generation problem is presented in Chapter 5, where the requirements (3.33) is replaced 



as follows : the measure of sliverness needs to be uniformly bounded on a refinement of T. 
The role of the measure of sliverness in the W l,p approximation error is again discussed 
in Chapter 6. 



Proof of Theorem 

we obtain 



3.2.6 



h-l 



m—l 



h\ 



Let T be a triangle and let h G C 1 (T). Using lemma 3.2.2 

w^p(T) — 



\\Vh 

< 

< \T\p(l 



VI 



m—l 



h\\jj>(T) 

h\\L°°(T) 

CS(T))\\Vh\\ L ~ {T ). 



T\p\\Vh- vr^r 1 



(3.38) 



Replacing h with tt in inequality (3.38) and combining it with (3.21), we obtain that if T 



contains the origin, then for all A E A n 

7T\w^'P(T) 



n - I™" 1 7t\ w i H t) < \T\ 1/P {1 + CS{T)) diam(A(r)) 



m—l 



The left and right quantities in the above inequality are invariant by translation of T and 
therefore this inequality remains valid for any T. Combining it with the identity 

diam(A(T)) 2 = \T\ \ det A\ p(A(T)), 



this leads to the first inequality (3.28) of Theorem 3.2.6[ For the second inequality, we 
take g G C m {T) and z 
development of g at zq, 



h(x,y) := g(x,y) - 



xo,yo) G T. We now take for h the remainder of the Taylor 

Qk+l, 



dx k dy 



9 , s (x - x ) k (y - y ) 1 
{ z o)- 



/! 
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Therefore h G C m {T) and 



h(z Q ) = dh(z ) 



d m - x h{z Q ) = 0. 



(3.39) 



It follows that 



VA||l«(t) < ^(diamT)™- 1 !^" 1 ^!!^^) = d|T| — p(T)~ ||d m /i|U~ ( T), (3.40) 



where Ci = C\{m). Combining (3.38) and (3.40), we obtain 

>r ^I^-(T) < C\T\^S(T)p(T)^\\d m h\\ L ^ Th 



\h-vr 1 



(3-41) 



where C = C(m). We now observe that g — hE P m -i, hence <i m g = c? m /i and h — I™ -1 h = 
g — I™ -1 g. Injecting this into the last equation we conclude the proof of inequality (3.29) 
and of Theorem 13.2.61 o 



As a conclusion to this section we prove inequality (3.10), which links the functions L m>p 



and L 



III I • III. x • 



Lemma 3.2.7. There exists a constant c = c(m) > such that for all 1 < p\ < P2 < oo, 



cL m T m y^ ^ L m p 2 ^ L m on JH rni 



(3.42) 



Proof: Let T eq be an equilateral triangle of area 1. Since all norms are equivalent on the 
finite dimensional space P^-n there exists a constant c = c(m) > such that for all 
(91,92) e IPm-i x Pm-l, 



c||(gi,g 2 )||L-(T cq ) < ||(gi,g 2 )||Li(r cq ), 



Furthermore, since T eq has area 1, we have 



l,«?2)|Upi(T eq ) < l|(9l,92)|Up2(T eq ), 



(3.43) 



(3.44) 



for all 1 < pi < P2 < 00. If T is a triangle satisfying |T| = 1, there exists an affine change 
of coordinates \I/ such that T = ^(T eq ) and we have ||(gi, ?2)||i,p(t) — 11(^1°^, ?2°^ f )||LP(r cq ) 
for all (gi, g 2 ) G P m -i x P m _i and 1 < p < 00. Combining this invariance property with 



inequalities (3.43) and (3.44) we obtain 



c||(gi,?2)IU«(T) < ||(?i,g 2 )IUw(T) < H(?i,g 2 )IUp2(T) < ||(?i,? 2 )IU«(t)- 



(3.45) 



We now choose a polynomial 7r G H m , we set (gi, g 2 ) := V7r — VI™ 7r, and we take the 



infimum of (3.45) among all triangles T of area 1. This leads to the announced inequality 



(3.42) which concludes the proof. 



o 
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3.3 Proof of Theorems 13.1.11 and I37L2 



The polygonal domain Q, the integer m, the function / 6 C m (Q) and the exponent 
1 < p < oo are fixed in this section which is devoted to the proof of the lower estimate 



(3.7) and the upper estimates (3.4) and (3.8) which are stated in Theorems 3.1.1 and 



EX2 

We denote by p, zo the Taylor polynomial of / and of degree m at the point z 
0o,2/o) e 

^ d k+l f{z ) (x - x ) fc (y - yo) 1 
^z [ x i y) ■— / J 



dx k dy l 



k\ 



l\ 



k+l<m 

Note that tt z is the homogeneous component of degree m in /i z . Therefore d m 7i z = d m fi z 
d m f(z) for any z£fl, and for any triangle T 



_ -rm— 1 _ , , , 



m—1 



(3.46) 



3.3.1 Proof of the lower estimate (3.7) 



The following lemma allows to bound by below the interpolation error of / on a triangle 



T. 



Lemma 3.3.1. Let \ := ^ + ±. For any triangle T dVt and z £ T we have 

1/ -It' 1 f\w^(T) > (W^) -^(diamT)/,^) 2 ^^^) 

where the function u> is positive, depends only on f and m, and satisfies oj{5) — > as 
5^0. 



Proof: Let h := / — Using Equation (3.46) we obtain 



\f — I™ /Iw^t) > 1^ — It KzIw^Ct) — |^ — I™ ft|iyi>p(T) 



> |ThL miP (7T 2 ) — I ft, — Iy 1 ft|iyl,P(T)- 



and we have seen in Theorem 13.2.61 that 

\h-1%r 1 h\ w i,p m < Co\T\$S(T)p(T)^\\d! n h\\ L o 0( i< ) . 
for some constant Cq > depending only on m. We then remark that 

\\d m h\\ L oo {T) = \\d m f - cT^IU-crj = IK/ - rf m /W||^(T). 
Therefore, defining 

u;(5):=C sup ||d-/(z)-d m /(zOII, 

2,z'£!! ; \z-z'\<8 

we conclude the proof of this lemma. 
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We now consider an admissible sequence of triangulations (Tn)n>n - For all N > N , 
T G Tn and z G T, we define 4>n{z) := |T| and 



ip N (z) := \L m!P (ir z ) - w(diam(T))p(T) 2 5(T) 
where A + := max{A,0}. Holder's inequality gives, with - := ^ 



m— 1 1 1 



(m — ljp \ p 

2 V? 



y /V 



• TV 



y /V 



(m— 1)t 
2" 



(3.47) 



Note that J n (j)^ = #Tn < iV. Furthermore if T G 7jv and z G T then according to 
Lemma 13.3.11 



hence 



(m — l)p 



M/i,p(T)' 



(«) 



Inequality (3.47) therefore leads to 



m\\mn) <\f- I?" 1 f\w^(n)N^. (3.48) 

Since the sequence (7n)n>n is admissible, there exists a constant C A > such that for 
all N and all T G 7jv we have diam(T) < C A N~2. We introduce a subset of C 7jv 
which gathers the most degenerate triangles 

% = {TeT N ; p{T) > uj(C a N-^)^}, 



m — 1 
2 



7at. For all z G \ fi'jy, recalling from (3.27) that p > S, we obtain 



where u; is the function from Lemma 3.3.1 We denote by Cl' N the portion of f2 covered by 



i[) N (z) > L miP (ir z ) - \Juj(C a N 2). 



Hence 



> 



> 



L m>p {ir z ) - \Ju{C A N a) 



C T \Q 



where C := max 2g ^ L m , p (ii z ). We next observe that |f^y| — >• as N — > +00 : indeed for 
all T G T' N we have 



|T| = diam(T) 2 p(T) _1 < C 2 A N~ l u(C A N- 



2 ) m + 1 



Since #TJ(r < N, we obtain (fi^l < C 2 A u{C A N which tends to as iV ->■ 00. We 

thus obtain 



liminf ||-0Jv|U^(n) > lim 



L T (Q) 



Combining this result with (3.48) we conclude the proof of the announced estimate (3.7). 
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3.3.2 Proof of the upper estimates (3.4) and (3.8) 



The proof of these upper estimates is based on an explicit construction of the trian- 
gulations Tn, which is adapted from the construction in [4|. Roughly speaking, the idea 
of this construction is to produce a first mesh 71 of the domain Q, composed of elements 
sufficiently small so that / can be regarded as a polynomial tir on each triangle R £ 1Z. 
Each element R e TZ is then tiled with small triangles optimally adapted to 7Tr, and some 
technical manipulations are done in order to preserve the conformity at the interfaces of 
the elements of TZ. The main difference with the construction first proposed in j4j, and 
used later in Chapter [2j is that the measure of sliverness S of the generated triangles 
should be kept under control. 

Let T be a triangle with vertices (zq, zi, z 2 )- We define the symmetrized triangle T of 
vertices (zi, z 2 , z\ + z 2 — Zq) so that T U T is a parallelogram. We define a tiling Vt of the 
plane 1R 2 as follows 

V T := {a(*i - zo) + p(z 2 -z )+T' ; a,p e 71, T e {T, T}}. (3.49) 

A homogeneous polynomial it e E m is either even or odd (depending on the parity 
of m). Combining this observation with the translation invariance ( |3.2 ) we obtain that 



7r — I™, tt\w 1 >p(t') is constant among all triangles V G Vt- We also define 

V T , n ■= -V T (3.50) 

n 

the tiling obtained by rescaling Vt by a factor -. We use this rescaled tiling in order 
to subdivide an arbitrary triangle R, up to a few additional triangles located near the 
boundary of R, as expressed by the following lemma. 

Lemma 3.3.2. Let R and T be two triangles. There exists a family {VT, n {R))n>o> of 
conforming triangulations of R such that the following holds 



1. Nearly all the elements ofVr,n{R) belong to Vt,u, which is defined by (3.50), in the 
sense that 



,. mThJM \R\ . #<Tln(R)) n 

hm = — — and hm = 0. (3.51 

n 2 T n~>oo n 2 



where 

V T>n (R) ■= V T ,n{R) n V T ,n and V T , n (R) ■= V T ,n(R) \ ^T,n (3.52) 

2. The vertices of Vt,u(R) ° n the boundary of R are exactly those of the form + 
(1 — -)b, where < k < n and a, b are vertices of R. 

3. There exists constants C\ = C\(R,T) and C 2 = C 2 {R,T) such that 

sup In max diam(T') ) < C\ and sup max S(T') < C 2 . (3.53) 

n > V T'&VtAR) ' n>0 T' G P T ,n(R) 
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Proof: See appendix. 

For any M > 0, we define the compact set of triangles 

T M := {T ; \T\ = 1, diam(T) < M and bary(T) = 0}. 
Note that for all T G Tm, 

p(T) < M 2 . 

We also define the function 

Lm{k) '■= min k - It" 1 Hw^t)- (3.54) 

Since T M is compact for the Hausdorff distance between sets and since T i— > \ir — 
I™ -1 7t| H /i, P ( T ) is continuous with respect to this distance on the set of all triangles, we 
find that this minimum is indeed attained and that Lm is continuous. We also observe 
that M i — y Lm(tt) is a decreasing function of M and that 



lim L M (n) = inf | vr — I™ 1 tt\w^p(t) = inf |vr — I™ 1 7r|wi.p(r) = £ 



M->oo |T|=l,bary(T)=0 |T|=1 



where we have used the invariance under translation of the interpolation error (3.2) for 
the second equality. 

The constant M > is now fixed until the last step of this proof. Let tt G H m and 
let T C be homothetic to a triangle achieving the minimum in the definition of Lm(7t). 
Then, 

1/ - Ir _1 f\w l >p{T) < k - I™ -1 Aw^p(T) + \(f ~ tt) - Ir _1 (/ - ^)\w^p{t) 

< \T\^L M (n) + C\T\$p(T) ^S(T)\\d m f - d m 7r\\ L <~ (T) (3.55) 



< \T\r(L M (n) + CM m+1 \\d m f -d m ir\\ La 



•(T)> 



where we have used inequality (3.29) in the second line, and in third line the fact that 

p(T)^S(T) < p{T)^ < M m+ \ 

since S < p and T is homothetic to an element of Tm- 

Let S > which value will be specified later. Since d m f is continuous, we can choose 
a sufficiently fine mesh 1Z = 1Z(M, 5) of Q in such way that, 

CM m+1 \\d m f{x) - d m f(y)\\ L oo {T) < 5, for all R G 11 and x,y G R. (3.56) 

For any triangle R G TZ we define 

z R := argmmL M (^ z ) and tt r := tt Zr . (3.57) 

zeR 

We also define 

T R :=(L M (n R )+S)-*T m , (3.58) 
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where T* G Tm achieves the minimum in the definition of Lm{t^r)- We denote by V n (R) 



Vt r ,ti(R) the triangulation of Lemma 3.3.2 built from the two triangles R and Tr, and 
similarly V*(R) = V^ Rn (R) and V%(R) = V^ RTl (R). We define for all n the global mesh 
of it 

T n M ' 5 = |J V n (R), 
Ren 

which coincides with V n (R) on each R G 1Z. Since all the meshes V n (R) are conforming, 
and since V n (R) has by construction n + 1 equispaced vertices on each edge of R, the 



mesh Tn is also conforming. According to Equations (3.51) and (3.57), we have 



= ^|i?|(L M (7r fi ) + 5r 



< 



(L m (tt,) + 5) T dz. 



(3.59) 



For T G Vn(R), we combine (3.55), (3.56) and (3.58) to obtain 

1/ - It" 1 /k^(T) < n-r for all T G V X n {R). 



For T G Vn(R), we invoke the isotropic estimate (3.29) to obtain 



(3.60) 



If-ir/kMT) < C|T|?5(T)diam(T) TO - l^/H^^ 
< C75 , (T)diam(T)"||d m /|| L <» ( „ ) 



(3.61) 



where C is the constant from ( 3.29| ). Using the third item in Lemma 3.3.2, we find that 
that there exists constants G\ = Ci(M, 5) and C 2 = C 2 (M, S) such that 

sup I n max diam(T) I < C\ and sup I max S(T) I < C 2 , (3.62) 

n>0 V TeT„ AM / n>0 \TeT^ S J 



so that, combining with (3.61), we have for all T G V^(R) 



\f — It 1 /Iv^i.p(t) — 



(3.63) 



with Co = C*o(M, 5). Combining (3.60) and (3.63), and using the first item in Lemma 



3.3.2 , we obtain 



1/ V^M,S f\whP(Ct) 



(T) 



TeT r , 
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therefore 



limsupn 2 - 2 \f -T M i f\ P w i, Hn) < Y] lim 



#{Vl{R))+#{Vl{R))Cl 



Ren 



j2\R\(L M M+sr 



Ren 



< 



{L M (n z ) + S) T dz. 



Combining this with ( |3.59[ ) we obtain 



lim sup #(T n 



M,S\ 



r jm-1 r 

J J-^-M.a J 

In 



< \\L M (n z ) + 5\\ LT 



(Q) 



(3.64) 



Let e > 0. Since 



lim \im\\L M (7r z ) + 5\\ L r {n) = lim \\L M (ir z )\\ L T {n) 

RI— >oo 5— s>0 M—toc 



\Lm,p(^z)\\L T (n) 



we can choose adequately M and S in such way that ||-^A/(^)+^IU T (n) < \\Lm, P (ji 'z)\\l t (n) + 
e. 

Let n = n(N, M, 5) be the largest integer such that #(7^ M,(5 ) < N, we define 

q-e _ q~M,5 

'N ■— >n 



so that iV 1 Jf(7 N ) — > 1 as N — > oo. If follows from (3.59) and (3.62) that the sequence of 
triangulations (T N ) is admissible, and inequality (3.64) gives 



limsupiV 2 \f -Ire f\w^p(n) < \\L miP (7r z )\\ L rr n) +e. 

N^oo N 



which is the upper estimate (3.8) announced. Last we choose for all N large enough 
e(N) > such that 



N^lf-l^Jlw^ < \\L m ^ z )\\ L r {Q) + 2e(N). 
1 1< 



and such that e(N) — > as N — > oo. The sequence of triangulations 7jv : = fulfills 



the estimate (3.4) which concludes the proof. 



3.4 Optimal metrics for linear and quadratic elements 



The proof of the upper estimate (3.8) exposed in the previous section involves the 



construction of meshes by tiling each element R of the "coarse" triangulation 1Z using 
the finer mesh V n (R). In practice, such a construction may require a very large number of 
triangles in order to match the optimal error estimate. More commonly used strategies for 
mesh generation are based on the prescription of a non-euclidean metric depending on / 
for which each triangle should be isotropic. In this section, we explain how to design such 
metric in order to derive near-optimal error estimates and we give analytic expressions in 
the particular case of Pi and P2 finite elements. 
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3.4.1 Optimal metrics 

As a first step, we express the requirements (i), (ii) and (iv) of mesh adaptation in terms 
of metrics. We therefore use the following notations : we consider a polygonal domain Q, 
an integer m > 2, an exponent 1 < p < oo, and a function / G C m (Q) to be approximated 
in the W l,p semi-norm by P m -i finite element interpolation on a triangulation of Q. We 
also consider two real numbers e > and 5 > 0. 

We define for all 7r G H m 

A'^ := {M G 5+ ; |Vtt(z)| 2 < (z^z)" 1 " 1 for all z G 1R 2 } = {A T A ; A G A}, (3.65) 

and we consider a continuous field M. : — > S£ of symmetric positive definite matrices 
satisfying JA(z) G A'^ for all 2 G f2 and 



C^(X m (7r a ) +e) < (detMiz))^ < C 2 (L m (n z ) + e), 



(3.66) 



where C2 is an absolute constant. The existence of such a field is established in full 
generality in Chapter 6. We explain in the sequel of this section a practical construction 
in the case of piecewise linear and bilinear elements. We then define a field of symmetric 
positive definite matrices h on Q by 



h(z) := 6- T (detM(z))^M(z) 



(3.67) 



where - : = 



m-l 1 1 



\-K Such a field h is called a Riemannian metric. Under some assumptions 



on the metric h and on the domain fl, which are discussed in 15,66 and Chapter 5 for 
the infinite domain ¥L d , it is possible to produce a triangulation T of Q satisfying for all 
T G T and z G T 



Cy 1 < \T\y/deth{z) < d and p (^(z)(T)) < C x 



(3.68) 



where the constant C% > 1 reflects the quality of the adaptation of the mesh to the metric 
h. (In the second inequality the square root is meant in the sense of symmetric positive 



matrices). Examples of such mesh generators are 19,93,94 . We shall not discuss in this 
chapter the conditions under which such a mesh can be generated. Let us only mention 
that, if one ignores a few outliers at the corners of Q, these conditions hold if 5 is small 
enough. 

Note that (deth(z))^ = <5 -1 (det M. (z))~^~ , therefore if (3.68) holds we find that for 
all T G T and z6T, 



{C X C 2 )- X 8 < \T\^{L m {-K z )+e) < C X C 2 5, 



hence condition (i) of equilibrated errors, as stated in (3.31), holds provided CV > C1C2. 
Furthermore for all z G Vt let us define A z := ^ 'Ai(z) and note that A z G A Wz and 
det/1^ = ^/detMjz). Using (3.66) and (3.68) we find that condition (ii) of optimal 



shapes, as stated in (3.32), holds provided Cf > C^- Condition (iv) holds when the mesh 
T is sufficiently refined, which is the case if 5 is small enough. 
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In summary, given a map M. : Q — > satisfying JA(z) G A' nz , (3.66) and such that 



M. (z) is positive definite, state of the art mesh generators allow us to build triangulations 
T that match the conditions (i), (ii) and (iv). In order to prove the near-optimal estimate 



#(T)^|/-I 



m— 1 

r 



\w l >p(a.) 



< C\\L r 



7T 2 



it is also necessary that the generated meshes satisfy condition (iii) of bounded measure 



of sliverness, as stated in (3.33). Unfortunately, the author has not heard of theoretical 



results that would guarantee this condition when a mesh is built by such algorithms, apart 
from Theorems |5. 1.14 and 6.1.2 which only apply to the infinite domain IR 2 or the periodic 
domain (IR/2Z) 2 respectively. We discuss in £3.4.2 the observed behaviour of S(T) when 
using the mesh generation software 93 . 

which correspond to JPi and P 2 elements, we give in the sequel a 



For m G {2, 3} 
simple expression of a continuous map M. 
7r G H m and 



: H m -> Sf satisfying M m {^) G A' w for all 



K- x L m {Ti) < (det M m (Tr))— < KL r 



7T 



(3.69) 



for some absolute constant K > 1. It is not hard to build from A4 m (iT z ) a matrix Ai(z) 



satisfying (3.66). For practical uses one usually takes 



M(z) :=7W m (7r 2 )+ y uId. 

where the constant /i > is here to avoid degeneracy problems. 

Let us mention that there exists radically different approaches to anisotropic mesh 
generation, which are not based on Riemannian metrics. For example the hierarchical 
refinement procedure exposed in Chapter [7j which is proved in Chapter 8 to yield the 
best possible estimate (3.1) in the case of piecewise linear interpolation of bidimensional 



convex functions with the error measured in L p norm. This approach does not seem to 
adapt well to the W l,p norm : the main problem arises again from condition (iii) of bounded 
measure of sliverness, and from the lack of conformity of the triangulations generated by 
this procedure. 



3.4.2 The case of linear and quadratic elements 



We now give analytic expression of matrix fields M.2 and satisfying (3.69), which 



correspond to linear and quadratic elements. In the simplest and already well established 



case of Pi elements, a more detailed analysis can be found in 85 . In contrast, the results 
for quadratic elements are new. 



For any homogeneous quadratic polynomial ir G H 2 , 7r = ax 2 + 2bxy + cy 2 , we define 
the symmetric matrix 

a b 
b c 



71 



We define 



M 2 (ir) :=4[tt] 2 = 4 



a 2 + b 2 ab + be 
ab + bc b 2 + c 2 



(3.70) 
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For all z G IR 2 one has Vn(z) = 2[ir]z, and therefore |V7r(z)| 2 
that M 2 {n) G A' w and 

det MM = inf{det M ; M G A' n } 



Z T M.2Mz. It follows 



which implies (3.69) according to Lemma 3.2.5 



For any ix G H 3 we define 



M 3 (n) := sj\d^ + [d y *]* 
If 7r = ax 3 + 3bx 2 y + 3cxy 2 + dy 3 then in terms of the coefficients of 7r 

MM = 



(3-71) 




a 2 + 2b 2 + c 2 a6 + 26c + cd 
ab + 2bc + cd b 2 + 2c 2 + d 2 



In the sense of symmetric matrices, we have 

MM = yj[d**\ 2 + [dy*] 2 > VW^ = \[d*n]\, 
where we used the fact that the square root y/^S® — > S® is increasing. It follows that 

|Vtt(z)| 2 = \d K *(*)\ 2 + IVWI a < ^M-Mz) 2 , 
hence V2M(ir) G A'(n). Note that 

det MM = VV( a2 + 2 & 2 + c 2 )(b 2 + 2c 2 + cP) - (ab + 2bc + cd) 2 . 



(3.72) 



It remains to establish (3.69). This point is postponed to § |3.5| right after ( 3.85[ ), as we 
develop a general method for obtaning simple equivalents of the functions L m . Let us 
finally mention the work [65] in which approximate solutions to the optimization problem 
inf {det M; Me A n } are obtained through numerical optimization. This approach works 
for general m but is harder to use than the algebraic expressions of A^v) and Mzijt) 
given here. 

3.4.3 Limiting the anisotropy in mesh adaptation 

The measure of non degeneracy of a triangle and of its image by a linear transform 
can be linked by the following result. 

Proposition 3.4.1. There exists an absolute constant c > such that for any triangle T 
and any A G GL 2 , 

- P(T) 



P(A(T)) 



< 



\A\\\\A 



(3.73) 
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Proof: We use in this proof the identity |det5| = x || 1 which holds for all 

B G GL 2 . Let T' be a triangle and let A' G GL 2 , then 

M(/mM diamM'fT')) 2 \\A'\\ 2 diam(T') 2 ln 
FV v ;; A'(T') ~ detA' \T'\ HK ' 



with the particular choice A 1 = A' 1 and T' = A(T) we obtain the right side of (3.73). 
Let T eq be an equilateral triangle of area 1, and let p be the diameter of the largest ball 
included in T eq . Up to a translation on T eq we can assume that there exists B G GL 2 such 
that T = B(T eq ). We then have 

diam(T)diam(A(T)) = diam( J B(T cq )) diam(AB(T eq )) 

> fi 2 \\B\\\\AB\\ 

> ^ii^iipiiiie- 1 !!- 1 

= p 2 \\A\\\detB\. 

Hence, since |T| = |£>(T eq )| = |detS|, 

m (A(T ss diam(T) 2 diam(A(T)) 2 (p 2 \\A\\ | det B\) 2 
P(T)P(A(T)) = \f\lMTj\ " \detB\ 2 \detA\ = HWIP " 



which establishes the left part of (3.73) with c = p 4 = 2 t S . o 

A consequence of the above lemma is that if T is a mesh adapted to a metric h in the 
sense of (3.68), then for all T G T and z G T we have 



cCrVUM^IIIIM^)- 1 !! < pCO < c^\\h{z)\\\\h-\z)\\. 

The measure of non- degeneracy p(T) is thus large when h(z) is ill conditioned. Although 
this property is desirable in order to adapt to highly anisotropic features of the function / 
to be approximated, excessive degeneracy can cause mesh generation problems, which are 
discussed in £ 3.4.2 In the following, we explain how to slightly modify the construction 
of A4 2 and A4s in order to control the value of p(T). 

According to ( ]3.67[ ) we have H/i^llll/i -1 ^)!! = ||A^(^)|| ||A^(^) _1 ||, and thus 

p(T)<C lV /\\M(z)\\\\M(z)-i\\. 

This leads us to define for all a > 1, 

A'^ := {M G A' n ; IIMIHIM- 1 !! < a 2 }. 

Let M G S2, let R be a rotation and let A > p > be the eigenvalues of M in such 
way that M = R T diag(A, p)R. We define for any a > 1 

M^:=R T ( X n ° Ar. (3.74) 

V max(Aa , p) J 
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Clearly M^> > M, and if M ^ then ||M( Q )|||| (AfW) 1 || < a 2 . Hence for all M G A'„ 
we have G A^ a . 

In the case of piecewise linear elements we therefore have Af 2 (f) G f° r a ^ 

7r G H 2 , and one easily shows that det = infjdetM ; M G A' na }. This suggests 

that constructing Ai(z) from Al 2 v 7 ^) instead of Af^^) leads to a near-optimal mesh 
adaptation to the function /, under the constraint p(T) < C\ct for all triangles T in 
the triangulation. The following proposition implies the same in the case of piecewise 
quadratic finite elements. 

Proposition 3.4.2. Let n G B 3 and a > 1. Then y/2M 3 a \tt) G A n and 



det M { 3 \ix) < KM{detM ; M G A'^J 
where the constant K is independent 0/71 and a. 



(3.75) 



Proof: We already know that \/2Ai 



ir)eA' if M 



7T 



Al 3 (7r) then (3.75) holds 



as a consequence of (3.69) and Lemma 3.2.5 We therefore assume in the fol 
M { 3 a) {TT) ^ M 3 {tt). Let 



owing that 



A*(?r) := ||Vvr| 



L oo (D) = sup \Vir(z) 
kl<i 



where D = {2; G 1R 2 ; |z| < 1} is the unit disc of IR 2 . The largest ball inscribed in 

{z G R 2 ; |V7r(*)| < 1} is A*(7r)"2£). Let M G and let Ai > A 2 > be its 

_ 1 

eigenvalues. The ellipse {z G M 2 ; z T Mz < 1} contains the ball X 1 2 D, hence Ai > A*(7r). 
Furthermore A 2 > ct~ 2 Ai, hence 

det M = AiA 2 > a~ 2 A 2 > «^ 2 A*(tt) 2 . (3.76) 

Let A(7r) be the largest eigenvalue of Af3(7r), and assume that n = ax 2 +3bx 2 y+3cxy 2 +dy 3 . 
We obtain from (3.71) that 



A(tt) < ^TrM 3 (n) 2 = Va 2 + 3b 2 + 3c 2 + d 2 . 

Since the norms || Vtt\\l°°(d) an d y/a 2 + 36 2 + 3c 2 + d 2 are equivalent on the vector space 
H 3 , there exists a constant C > independent of 7r G H 3 such that \(ti) < Co\*(n). 
Since M. 3 a \7r) 7^ Afs(7r), the eigeinvalues of M. ( 3 a \ir) are X(tt) and a _2 A(7r). Hence 



det M^\ir) = cr 2 A(7r) 2 < C 2 a- 2 A*(7r) 2 . 



Combining this with (3.76) we conclude the proof, with K = C { 



Let us finally mention that, although they are derived from the coefficients of 7r, the 
maps 71 1 — y Al m (7r) and tt i— > Aim (*7r) for m G {2,3} are invariant under rotation, and 
therefore not tied to the chosen system of coordinate (x, y), as expressed by the following 
result. 



3.4. Optimal metrics for linear and quadratic elements 



139 



Proposition 3.4.3. For any m G {2,3} ; any n G M m and any unitary matrix U G O2, 
one has 

M m {7T O U) = U T M m {7l)U. 

Furthermore, for any a > 1 one has Aim (ft U) = U T Mt\n)U. 

Proof: We only prove the invariance under unitary transformation of Ai 3 , since the 
proof for M.2 is elementary, as well at the result for Aim ■ Let it G H 3 , let D x = [d x ir] 

and D y = [d y n]. Let U = ^ ^ n J be unitary, then 

[d x (n o [/)] = u u U T D x U + u l2 U T D y U and [^(vr o U)} = u 21 U T D X U + u 22 U i: D y U 
Hence 

+(wu«i2 + u 21 U2 2 )U T (D x D y + D y D x )U 
+ (u 2 12 + u 2 22)U T D 2 y U 

which equals U T D 2 U + U T D 2 U since U is unitary. Eventually 



M 3 (n o [/) = ^/U T D 2 X U + U T D 2 y U = U T ^D 2 X + D 2 y U = U T M 3 (n)U 
which concludes the proof. o 

3.4.4 Numerical results 

The envisionned applications for the theory developped in this chapter are mainly in 
the field of partial differential equations that exhibit "shocks", and strongly anisotropic 
features, in particular conservation laws and fluid dynamics. We therefore test the qua- 
lity of our meshes on a synthetic function that mimics the typical behavior of functions 
encountered in these contexts. For all 5 > 0, our test function /<$ : [—1, l] 2 — > R is defined 
as follows 

r 1 \ 1 ( % x ~ sin(5?/) \ o 2 
Js{x, y) := tanh I I + x + xy . 

In all numerical results, we choose 5 := 0.1. This function f$, although smooth, exhibits 
a "smoothed jump" of height 2 along to the curve defined by the equation 2x = sm(5y), 
on a layer of width S. On the rest of the domain, f§ is dominated by the polynomial part 
x 3 + xy 2 . The level lines and a 3D plot of fg are presented on the two rightmost pictures 



of Figure 3.2 



Our purpose is to produce four triangulations Tm,]p 1 , Tm,v 2 i 7l 2 ,Pi an d 7l 2 ,p 2 contai- 
ning 2000 triangles each and which, for this cardinality, produce respectilvely the smallest 
possible interpolation errors || V/ - V l l r /|| 2 , || V/- V l 2 T f\\ 2 , ||/-I^/|| 2 and ||/-I^/|| 2 - 
It is clearly out of reach to find the triangulations leading exactly to the smallest error. 
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sin(5 y) — 2 x 






Figure 3.2 - Description of the function f s , 5 = 0.1. 



Following the analysis developed in the beginning of this section we have generated Tffi ) p 1 
and Tm p 2 based on the metrics 



h H \wA z ) 



A^detM^wHM 100 ^ 

\ 2 (detM 3 (Tr z ))-*M 3 M 



T\ 7 



where 
where 



7T 2 



d 2 h(z) 

2 ' 
d 3 fs(z) 



(3.77) 



where the positive constants Ai,A 2 are adjusted in such way that the meshes generated 
have 2000 elements. Mesh generation was performe d by the open source program Free- 



FEM++ 



93 



and results are illustrated on Figure 



3.3 



Note that we have used M. 



(100) 



(defined as in (3.74)) instead of M. 2 which would lead to a different triangulation T^i p , 
also displayed on F gure |3.3[ and associated to the metric 



frfrSFiO*) := At(detA^ 2 (vr z )) *M 2 (ir z ) where w z 



d 2 fs(z) 



with again A^ adjusted to obtain 2000 elements. The use of M.^^ in place of M. 2 is 
justified by mesh generation issues which are discussed in the next subsection. 

Similarly, and following the study developed in Chapter [2j we have generated 7l2 iPi 
and Tl2 P2 from the metrics 



//i(det N 2 {ir z )) vj\f 2 (7r z ) where 
H2(detAf3(n z )y^J\f 3 (n z ) where 



K z 
71% 



d 2 fs(z) 
2 ' 



where again /^i,// 2 are positive constants adjusted in order to generate a mesh with 2000 
elements. Here N 2 {ir) '■= a/^M^O an d 

J\f 3 (ir) := argmin{detM ; M E S£ and \tt(z)\ < (z T Mz)^ for all z E R 2 }. 

We have obtained the following results, which confirm that the use of the metric adapted to 
a given norm and interpolation degree produces the triangulation that yields the smallest 
interpolation error in this case (at least among these four triangulations). 



#r = 2000 




Tffi.Fa 


7l 2 ,Pi 


7l 2 ,f 2 


1/5 - I7- fs Iffi 


1.35 


1.47 


1.43 


1.63 


Ws-lrfslHi 
10 2 ||/ 5 -I^/,|| L2 
10 4 ||/ 5 -I^/ 5 || L2 


1.66 


1.17 


1.89 


1.47 


1.54 


2.73 


0.759 


1.18 


6.64 


6.61 


4.73 


3.17 



(3.78) 
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Figure 3.3 - The meshes T^i p , Th-i i p 1 and Th- 1 ,p 2 adapted to fg (500 triangles only). 

3.4.5 Quality of a triangulation generated from a metric 

Given a metric h : £1 — > S 1 ^, there does not always exists a triangulation T adapted 



to h, i.e. satisfying (3.68) for some constant G\ > 1 not too large. Such a triangulation 



exists only if h satisfies some constraints which are analyzed in 66 , see also Chapter 5. 
Instead of analysing the metric h prior to the process of mesh generation, we choose here 
the simpler option of evaluating a posteriori the quality of a triangulation T. 



Since we are interested in the H 1 = W 1,2 semi norm we define following (3.33) 

S(T):= [^J^S{Tf 
For all T G T we define := /i(bary(T)) G S^- We also define the sets 



E := {in (\T\^/deth T ) ; T G t| and F := {p ^(^)) ! ^ e r} 



According to (3.68), the quality of T is reflected by the quantities 

exp(max£' — mini?) and maxF. 

However these quantities give a rather pessimistic account of the adaptation of T to h, 
and heuristically we find it more fruitful to consider averages. We therefore define 

1 



and 



p(T,h) :=^£p(7MT) 



<t(T, h) := exp 



#(£) 



E 



E 1 



The following table shows that the quantities S(T), p{T, h) and cx(T, /i) are abnormally 
large for the triangulation T^i Pi generated from the metric h * H± Pi but reasonable for the 
triangulations Th-i i p 1 and Th^,p 2 generated from the metrics (3.77). 



#r = 2000 








S(T) 


14.2 


3.14 


4.04 


P(T, h) 


10.6 


6.02 


4.18 


a(T,h) 


2.39 


2.25 


1.70 
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In practice T£i p led to a poor interpolation error, contrary to Tm,w> 1 - We believe 
that the poor quality of T^i P is due to the excessively wild behavior of the metric h* Hl Pi 
and not to a deficiency of the excellent mesh generator BAMG 93 



3.5 Polynomial equivalents of the shape function 



The optimal error estimates established in Theorem 3.1.1 involve the quantity L TO)P (— f). 



The function tt (->■ L miP (jr) is obtained by solving an optimization problem, and it does 
not have an explicit analytic expression in terms of the coefficients of tt G H m . In this 
section, we introduce quantities which are equivalent to L m (7r), and therefore to L m ^{ii) 
for all 1 < p < oo, and which can be written in analytic form in terms of the coefficients 
of 7T £ H m . 

Given a pair of non negative functions Q and R on M m we write Q ~ R if and only 
if there exists a constant C > such that C~ X Q < R < CQ uniformly on H m . We 
sometimes slightly abuse notations and write Q(tt) ~ R(jr). We say that a function Q is 
a polynomial on H m if there exists a polynomial P of m + 1 real variables such that for 
all ao, • ■ ■ ,(i m GK, 



Q fliX> m i = P(a , • • • , cim). 



We define degQ := degP, and we say that Q is homogeneous if P is homogeneous. For 
all m > 2, we shall build an homogeneous polynomial Q on H m such that 

L m ~ <A0\ wi th r := degQ, (3.79) 

where the constants in the equivalence only depend on m. 
We first introduce for all it g H m the set 



B n := {B G M 2 (R) ; \ir(z)\ < \Bz\ m for all z G M 2 } 

and the function 

K^(n) :=inf{|detP|^ ; B G B n }. 



According to Lemma 3.2.5 we have for any m > 2 



L m (7r) ~ y/K§^_ 2 (\Vic\*) (3.80) 

where |V7r| 2 = (d x Tr) 2 + (d y 7i) 2 G H 2m _2- The function is extensively studied in 
Chapter 2. In particular we know that 

Ki (tt) ~ v^H, (3-81) 

and 

ATf (tt) ~ </|disc(7r)| (3.82) 
where disc(7r) denotes the discriminant of a polynomial tt G H 3 , namely 

disc(ax 3 + bx 2 y + cxy 2 + dy 3 ) = b 2 c 2 - Aac 3 - Ab 3 d + 18abcd - 27a 2 d 2 . 
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More generally, it is proved in Chapter [2] that for all m > 2, the function has an 
equivalent of the form where Q is an homogeneous polynomial of degree r on M m . 



Combining this result with (3.80) we obtain the main result of this section. 



Proposition 3.5.1. Let m > 2 and let Q be an homogeneous polynomial on M^m-i such 
that KS 



2m-2 ~ v \Q\> where r = degQ. Let be the polynomial on H m defined by 

Q*(tt) := Q(|Vtt| 2 ). 

Then L m ~ y/Ql 071 ^m- 

\2[n]z\ 2 = Az t [tt] 2 z. Using (ggjt we 



Let 7r G H 2 and let us observe that |V7r(z)| 2 
therefore obtain 



Lo(tt) 



K$(\V 



7l\ 



^/v / det(4[ 7 r] 2 ) =2v/|det[7r]|. 



(3.83) 



The construction suggested by Theorem 3.5.1 uses an equivalent of -Kf m _ 2 to produce 



an equivalent to L m . Unfortunately, as m increases, the practical construction of Q such 
that y \Q\ is equivalent to becomes more involved and the degree r quickly raises. In 
the following theorem, we build an equivalent to L m from an equivalent of instead 



of K^ m _ 2 , which is therefore simpler. 



Theorem 3.5.2. Let m > 3 and let Q be an homogeneous polynomial on IH m ^i such that 
Km-i ~ vlOl; where r = degQ. Let (Qk)o<k<r be the homogeneous polynomials of degree 
r on if/m-i x H m -x such that for all and all 7i"i, 7r 2 G M m we have 



Q(wr 1 + wr 2 ) = (Ip*^ k Qk(n,^2), 

0<k<r ^ ' 



(3.84) 



where (I) :- 



k\{r-k)\ 



. Let be the polynomial defined for all tt G M m by 



0<k<r ^ ' 



TV 



then L r 



fQl on H„ 



Proof: See Appendix. 



o 



Using this construction and (3.81) we obtain an equivalent of L3 as follows. Let tt\ 
ax 2 + 2bxy + cy 2 and 7r 2 = a'x 2 + 2b' xy + c'y 2 be two elements of H 2 . We obtain 

det([wri + V7T 2 }) = (ua + va'){uc + vc') — (ub + vb') 2 

= u 2 (ac — b 2 ) + uv(ac + a'c — 2bb') + v 2 (a'c — b' 2 ). 



Applying the construction of Theorem 3.5.2 to tt = ax 3 + 3bx 2 y + 3cxy 2 + dy 3 G H 3 we 

obtain 

L 3 (tt) ~ 3i/(ac - b 2 ) 2 + (ad - bc) 2 /2 + (bd - c 2 ) 2 . (3.85) 
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Remarking that 

2[(ac - 6 2 ) 2 + {ad - bc) 2 /2 + (bd - c 2 ) 2 ] 



(a 2 + lb 2 + c 2 ) (6 2 + 2c 2 + d 2 ) - (ab + 26c + cdf 



and using equation (3.72) we obtain that L^tt) ~ a/ det M.^(ji). This point was announ- 



ced in 3.4.1 and establishes that the map A4 3 defined in (3.71) can be used for optimal 



mesh adaptation for quadratic finite elements. 



Using (3.82) and the construction of Theorem 3.5.2, we also obtain an equivalent of L^ir 



l 4 M 





(36 2 c 2 


- 4ac 3 - 


46 3 d + 6abcd - a 2 d 2 ) 2 


+ 


(26c 3 - 


- 6ac 2 d H 


- Aabd 2 - 46 3 e + 6a6ce - 2a 2 rfe) 2 /4 


+ 


(3c 4 - 


66c 2 d + 


8b 2 d 2 - Qacd 2 - Qb 2 ce + Qac 2 e + 2abde - a 2 e 2 ) 2 /Q 


+ 


(2c 3 d - 


- 4a<i 3 - 


66c 2 e + 4b 2 de + Qacde - 2abe 2 ) 2 /4 


+ 


(3c 2 d 2 


- 4bd 3 - 


-4c 3 e + 6bcde-b 2 e 2 ) 2 . 



The following proposition identifies the polynomials 7r G IH m for which L m (7r) = 0, 
and therefore the values of d m f for which anisotropic mesh adaptation may lead to super- 
convergence. 

Proposition 3.5.3. Let m > 2 and let t m := |_ ZI ^J . Then for all it G M m , 

L m (7r) = if and only if n = (ax + (3y) tm Tr for some a, fl G R and 7r G M m ^ tm . (3.86) 

Proof: According to ( |3.80j ), L m (ix) = if and only if ATf m _ 2 (| Vvr| 2 ) = 0. On the other 
hand, it is proved in Chapter [2] that for any 7r* G H 2m -2 one has i^fm-^ 7 **) = if and 
only if 7r* has a linear factor of multiplicity m. Therefore L m (vr) = if and only |V7r| 2 is 
a multiple of l m , where / is of the form / = ax + fly. 

Let us first assume that |V7r| 2 = (d x 7i) 2 + (d y ir) 2 has such a form. Since they are non- 
negative, (d x Tr) 2 and (d y Tr) 2 are both multiples of l m . Therefore d x ir and <9 y 7r are multiples 
of I s where s is an integer such that 2s > m, hence s > t m — 1. We therefore have 

d x ir = l s 7Ti and d y ir = / s 7r 2 where 7Ti, 7t 2 G H m _i_ s 

Recalling that I = ax + fly we obtain 

= d 2 x n - d 2 xy n = l B (d v m - d x 7T 2 ) + sl s -\fl^ - mr 2 ), 

hence fliri — a7r 2 is a multiple of I. Since 7r is homogenous of degree m it obeys the Euler 
identity mir(z) = (z,Vn(z)) for all z G M. 2 . Assuming without loss of generality that 
a / 0, we therefore obtain 

m7c(x, y) = I s (xtti + yTT 2 ) = I s ((ax + fly)— + — («7r 2 — fliti) ] 

V a a / 

which shows that it is a multiple of / s+1 , hence of l tm . 

Conversely if tt is a multiple of l tm then d x ir and d y it are both multiples of i* m_1 . Since 
2(t m — 1) > m the polynomial |V7r| 2 is a multiple of l m which concludes the proof. o 
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3.6 Extension to higher dimension 



This section partially extends the results exposed in the previous sections to functions 

and of the 



of d variables. We give in £3.6.1 
measure of sliverness S. 



the generalisations of the shape function L 



m,p 



Subsection £3.6.2 is devoted to interpolation error estimates. We prove a local d- 



dimensional error estimate in Theorem 3.6.6 which generalises Theorem 3.2.6 We then 



establish an asymptotic lower error estimate in Theorem 3.6.7 which generalises Theorem 
3.1.2 We give sufficient conditions under which the interpolation on a <i-dimensional 



mesh T achieves this optimal lower bound up to a multiplicative constant. However due 
to technical issues linked to the measure of sliverness S we were not able to construct 
such meshes, and we therefore state the upper bound as a conjecture. 



We discuss in subsection £3.6.3 the construction of optimal metrics for practical mesh 



generation. We partially extend the results of £3.4 and raise open questions. 



3.6.1 Generalisation of the shape function and of the measure 
of sliverness. 

We extend in this section the tools used in our analysis of optimally adapted triangu- 
lations to arbitrary dimension d. We begin with the spaces of polynomials. Let 

H mi d := Spanjx" 1 • • • x^ d ; \oz\ — m} and P m ,d '■= Spanja;" 1 • • • x^ d ; \a\ < m}, 

where a = (a%, • • • , a^) denotes a d-plet of non- negative integers, |a| := a% H h a^. For 

any simplex T the Lagrange interpolation operator I™ : C°(T) — > P m .d is defined by impo- 
sing 7(7) = Iy /(t) for all points 7 with barycentric coordinates in the set {0, ■ , 1} 
with respect to the vertices of T. For all n e H m d we define 

L m ,dA n ) '■= inf llVvr - VI™ -1 7r|| iP(T ), 
| T | =1 

where the infimum is taken on the set of <i-dimensional simplices of unit volume. Similarly 
to (3.10) the functions L m ^ py 1 < p < 00, are uniformly equivalent on M md . We define 



The distance defined at (3.14) between triangles extends easily to simplices. Given two 
ci-dimensional simplices T, T' there are precisely (d + 1)! affine transformations ^ such 
that $f(T) = T' . For each such we denote by ip its linear part and we define 

d{T,T') := lnAnf{«;(^) ; *(T) = T'} 

We say that a <i-dimensional simplex T is acute if the exterior normals n, n' to any two 
distinct faces F, F' of T have a negative scalar product (n, n'). In other words if all pairs 
of faces of T form acute dihedral angles. We denote the set of acute simplices by A and 
we generalise the measure of sliverness to arbitrary dimension d as follows 



S(T) := expd(T, A) = inf{ K (^) ; *(T) G A}. 



(3.87) 
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Similarly to (3.15), the quantity S(T) reflects the distance from a simplex T to the set 
of acute simplexes A. The definition (3.87) of S(T) raises a legitimate question : how 
to produce an afhne transformation \1/ such that \1>(T) has acute angles, and is 
comparable to S(T) ? This question is answered by the following proposition. 

For any d-dimensional simplex T with vertices (vi)o<i< d , we define the symmetric 
matrix 

Mt := / e n e Tii where e« := -r~ ~\- 

0<i<j<d 1 

Observe that 



v .i\ 



1 < || >/Mt\\ = V\\ M t\\ < ol a where a d := \j d ( d + l \ ( 3 . 89 ) 

since a\ is the number of distinct pairs (i, j) satisfying < i < j < d. 

_ i 

Proposition 3.6.1. For any simplex T, the simplex M T 2 (T) is acute and 

S(T) < K (y/Mr) < a d S(T). (3.90) 
Proof: See appendix. o 



Remark 3.6.2. In the paper '621 an alternative measure of sliverness S'(T) of a simplex 
T is introduced, and defined as 

S'(T) := I inf max \(u, ei ,-)| I 

\\u\=l i<j ' J 

This quantity is equivalent to S(T) . Indeed, for any u G W 1 we have 

-- ^u T M T u 



max | [u, eij 

i<3 




< a d max\(u,eij}\ } 



which implies that a^ 1 S"(T) < \\M T 2 \\ < S'(T), hence a^SiT) < S'(T) < a d S(T) using 
(3.89). Our approach therefore introduces a new geometrical interpretation to the quantity 
S' introduced in [62\ as the distance from a given simplex to the set of acute simplices. 



The following lemma generalises Lemma 3.2.2 and shows that the interpolation process 
is stable in the L°° norm of the gradient if the measure of sliverness is controlled. Let us 
mention that a slightly different version of this lemma can be found in 62 , yet not exactly 
adapted to our purposes. 

Lemma 3.6.3. For all m > 2 and all d > 2 there exists a constant C = C(m,d) such 
that for any d-dimensional simplex T and any f G W 1,00 (T), one has 



|VI^/|U~ (T )<CS(T)||V/|Uoo (r) . 



(3.9i; 
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Proof: The proof this lemma is extremely similar to the proof of Lemma (3.2.2). Let To 
be the simplex which vertices are the origin and the canonical basis of R . For the same 



reason as in Lemma 3.2.2, if a function g(x\, x%, • • • ,Xd) £ C°(T ) does not depend on 
the coordinate Xd, then I^ -1 g does not depend on x d either. Using the same reasonning 
as in Lemma 3.2.2 we obtain that there exists a constant Cq = Co(m,d) such that for all 
g £ W^iTo) 



dv? g 



dx,i 



<C 



L°°(T ) 



dg 



dx t , 



L°°(T ) 



Again similarly to the proof of Lemma |3.2.2| we obtain using a change of variables that 
for any simplex T, any / £ W 1,00 (T) and any edge vector u of T 

\\(u,Vl^f)\\ L ^ {T) <C G \\(u,Vf)\\ L ^ T) . 



We use the notations of Proposition 3.6.1 and we define a norm \v\t on R by 

\2 



\V 



2 T :=v T M T v= ( w > e *i>' 

0<i<j<d 



Observe that 



i^r'ir 1 



v\ < \v\ T < ||Af||||v|. 



(3.92) 



Then, since is proportional to an edge vector of T, 

II 1^ I™ /|t||l°o( T ) < ^2 II ( e *i' ^ f)\W°°(T) 

0<i<j<d 

< c IK^>W>II L°°(T) 
o<«<j<rf 



Combining this result with (3.92) we obtain 



\M T 2 \ 



IVI^/IUoc^^Co^IIMIIHIV/IUo 



•(T) 



and we conclude the proof using (3.90) 



o 



The oscillation of the gradient of the interpolated function is an important problem en- 
countered by numerical methods that try to take advantage of highly anisotropic meshes, 



see the discussion in 85 . As the previous lemma shows, such oscillations are kept under 



control if S(T) is bounded on the mesh of interest. For checking this property in prati- 
cal situations one needs an equivalent of the sliverness S that can be computed at low 



numerical cost. The formula (3.87) is clearly not adapted, since it involves a complicated 



optimisation procedure. Instead we propose to use 



S(T) := 

Observing that \\M^\\ < S(T) < \fd\\M~ 
obtain 



Tr(M T 1 ). (3.93) 
, and recalling that 1 < \\Mt\\ < «d we 



a^SiT) < S(T) < Vda d S(T). 
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Figure 3.4 - Examples of Good anisotropy (Thin lines, S(T) ~ 1), and Bad anisotropy 
(Thick lines, S(T) > 1) . 

Note that S(T) has an analytic expression in terms of the coordinates of T : the square 
root of the ratio of two polynomials in the positions of the vertices of T. 



Remark 3.6.4. We illustrate the sharpness of inequality (3.91) in a simple example. Let 
x,y,z be the coordinates on M 3 and let ttq '■= x 2 £ i^2,3- Let T\ be the tetrahedron of 
vertices (—A, 0,0), (A, 0,0), (A, 1,0) and (0,0,1). Simple computations show that 

S (T ) /5~ 

||VIr A 7To||l°°(t a ) = A 2 , ||V7t ||l°°(t a ) = 2A and Hm ^ A = J -. 

Let T' x be defined by replacing the vertex (—A, 0,0) of T\ with (0,0,0). Then 

3 

||VIr A 7r ||z,oo(T') = A, ||V7To||l°°(t a ) = 2A and lim S(T' X ) = -. 

Hence the simplices T\ and T' x have very different interpolation properties for large A, 
although they have a similar aspect ratio. They are representatives of "bad" and "good" 

anisotropy respectively. The tetrahedrons Ta and T3 are illustrated on the left of Figure 

2 2 

4, bottom and top respectively. 

For any <i-dimensional simplex T, we define its measure of non degeneracy by 

diam(T) a! 



P(T) 



\T\ 



Let T* be a fixed (i-dimensional acute simplex, for instance the reference equilateral sim- 
plex. For any (i-dimensional simplex T let ip £ GL^ and z £ lR d be such that T = z+ij)(T*). 
Since T* is acute, we obtain a generalization of ( |3.27| ) 

S(T) < «#) < U\\ d \ det^l" 1 < d ^[/ ^ = C(d)p(T). 

where /i(T*) is the diameter of the largest ball incribed in T*, and where we have used the 
inequality | det(-0 —1 ) | > IIV'II - ^ - ■ This last inequality can be derived by using the 

singular value decomposition ip = U diag(Ai, • • • , \d)V with < Ai < • • • < A^ and noting 
that H^ll = \ d and = A^ 1 . 
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Generalizing (3.20), we define for all 7r G H md , 

A := {A G Ma(R) ; |Vtt(z)| < lAz]" 1 " 1 for all 2 G 

Geometrically, one has A G ^ if and only if the ellipsoid {z G M d ; \Az\ < 1} is included 
in the algebraic set {z G M d ; |V7r(z)| < 1}. This leads us to the generalisation of Lemma 
1X231 

Lemma 3.6.5. For all m > 2 and all d > 2 there exists a constant C = C(m,d) such 
that for all n G JH m ,d, we have 

C^L^i-ir) < inf{|det Af? ; A G A^} < CL m4 (n). 
Proof: The proof of this lemma is completely similar to the proof of its bidimensional 



version Lemma 3.2.5 The only point that needs to be properly generalized is the following : 
given a matrix A G GL^, how to construct an acute simplex T = T(A) such that p(A(T)) 
is bounded independently of A ? 

The following construction is not the simplest but will be useful in our subsequent 
analysis. Let A = U DV, be the singular value decomposition of A, where U, V are ortho- 
gonal matrices and D is a diagonal matrix with positive diagonal entries (\i)i<i<d- We 
define the Kuhn simplex To 

T := {x G [0, l} d ; x x > x 2 > ■ ■ ■ > x d }, 

and T := V T D- 1 T . Then p(A(T)) = p(U(T )) = p(T ) = d\d d l 2 which is independent of 
A. We now show that T is an acute simplex. 

Let (ei, • • • , ed) be the canonical basis of M d , and let by convention eo = ej+i = 0. For 
< i < d, an easy computation shows that the the exterior normal to the face Fi of To, 
opposite to the vertex v j = J2 0<k<i £k, is rii = ^z^t^ti ■ It follows that the exterior normal 
n't to the face D~ 1 (F i ) of the simplex T> -1 (T ) is 

, D(ni) Xtei - A l+ ie i+ i 



\D(ni)\ |Aiej-A l+ ie 



Hence (n'^n'j) — if \i — j\ > 1, and (n^,n^ +1 ) < for all < % < d — 1. It follows that 
the simplex D _1 (T ) is acute, and therefore T = V T D~ 1 (T ) is also acute since V is a 
rotation. o 



3.6.2 Generalisation of the error estimates 

We present in this section the generalisation to higher dimension of our anisotropic 
error estimates. We prove a local error estimate in theorem 3.6.6 and an asymptotic lower 
estimate in |3.6.7| We also point out in conjecture |3. 6. 8| a technical point which, if proved, 
would lead to the optimal asymptotic upper estimates (3.98) and (3.99). 
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Theorem 3.6.6. For all m > 2 and d>2 there exists a constant C = C(m,d) such that 
for all 7r G M m a, all A G A n and any simplex T we have 



i 



m— 1 



where - :- 



\n - I™' 1 ir\ w i, P(T) < C\T\^ S(T)p(A(T))^\detA\^r 
m J^ + -. Furthermore for any g G C m {T) we have 

rm-l 



m — 1 



\g - g\ w ,, P{T) < C\T\^S(T)p(T) — \\d m g\\ L 



>(T)- 



Proof: It is a straightforward generalization of the proof of Theorem 3.2.6 



(3.94) 



Combining these two estimates, we can obtain a mixed estimate similar to (3.30), with 



the new value of r and the generalised S and p. For all m > 2 and d > 2 there exists a 
constant C = C(m, d) such that for any simplex T, any / G C m (T), any tt G H m and any 
A G Ar 



< C|T|^(T) (p{A{T))^\detA\^+p{T)^\\d m f-d m >K\\ L o 0{T) 



(3.95) 



This leads us to a straightforward generalisation of the points (i) to (iv) exposed in (3.31 ) 



Similarly to the bidimensional case (3.37) if a triangulation T meets these requirements, 
then it satisfies the error estimate 



#(T) — Y f\vn,P {a )<C\\L r 



7T 2 



lL T (f2) • 



(3.96) 



Generalizing (3.6), we say that a sequence (Tn)n>n of simplicial meshes of a d- 



dimensional polygonal domain is admissible if #(7jv) — N and if there exists a constant 
> such that 

sup diam(T) < C A N~ l d. 



TeTjv 



Similarly to (3.7), it can be shown that (3.96) cannot be improved for an admissible 



sequence of triangulations, in the following asymptotical sense. 

Theorem 3.6.7. Let (Tn)n>n be an admissible sequence of triangulations of a domain 
tt, let f G C m {Vt) and 1 < p < oo. Then 



\imini N—\f f\ w i, P{a) > 



L 



m,d,p 



d m f 



(3.97) 



L T (Cl) 



where - :- 



m—l I 1 
d "T" p- 



Proof: It is identical to the proof of the bidimensional estimate (3.7), which is exposed 
in C3Tl 



In contrast, the upper estimates (3.4) and (3.8) do not generalize easily to higher 



dimension. A first problem is that the bidimensional mesh Vt defined in Equation (3.49) 



has no equivalent in higher dimension, in the sense that we cannot exactly tile the space by 
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simplices of optimal shape. We may however build a tiling made of near optimal simplices, 
based on the following procedure : for any permutation a G of {1, • • • , d} we define 

T a := {x G [0, l} d ; x am >■■■> x a ^}. 

Let A G GLd(R), and let A = UDV be the singular value decomposition of A, where U 
and V are unitary and D is diagonal. We define 



V 



A ■- 



{V T D~\T a + z); aeZ d , ze7Z d }, 



which is a tiling of M. d built of acute simplices T satisfying p(A(T)) = d\d d / 2 (these 



properties are established in the proof of Lemma 3.6.3). Using such a tiling, we would 



like to build partitions Vau(R) °f an Y rf-dimensional simplex i?, with properties similar 



to those expressed in Lemma 3.3.2 for the triangulations Vtu(R)- At the present stage we 



do not know how to properly adapt the construction of Va,u(R) near the boundary of R 
in order to respect the condition on the measure of sliverness. The following conjecture, 



if established, would serve as a generalisation of Lemma 3.3.2 



Conjecture 3.6.8. Let R be a d-dimensional simplex, and let A G GL^(M). There exists 
a sequence (VA,n{R))N>a, of conformal triangulations of R such that 

- Nearly all the elements of TZn belong to Vau '■= -'Pa, in the sense that 



lim 



MKJR)) d\\R\ 



rr 



detA\ 



and lim ; = U. 



n" 



where 



KJR) ■= VaAR) n V A , n and V% n (R) := V A , n (R) \ Pa,u 

- The restriction ofV A , n (R) to a face F of R is its standard periodic tiling with n d ~ x 
elements. 

- The sequence (VA,n(R))n>o satisfies 

sup I n max diam(T) I < oo and sup max S(T) < oo. 

n>0 V T ^A,n(R) ) n>0 T€PaA r ) 

The validity of this conjecture would imply the following result using the same proof 



as for the estimates (3.4) and (3.8) established in ^3.3.1 



Conjecture 3.6.9. For all m > 2 there exists a constant C = C(m,d) such that the 
following holds. Let Q C M. d be polygonal domain, let f G C m (Q) and 1 < p < oo. Then 
there exists a sequence (Tn)n>n of simplicial meshes of fl such that #(7jv) < N and 



lim sup iV" 

iV->oo 



'I/- It/ /I 



< C 



L 



m,d 



d m f 



ml 



(3.98) 



L T {VL) 



m— 1 



where - 

T 

simplicial meshes (7J 



Furthermore, for all e > 0, there exists an admissible sequence of 
n)n>n of Q such that #(TJv) < iV and 

d m f 



lim sup iV— |/ - CrT f\ w i* m < C 



L 



m,d 



ml 



+ £. 



(3.99) 



L T (f2) 
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3.6.3 Optimal metrics and algebraic expressions of the shape 
function. 

The theory of anisotropic mesh generation in dimension three or higher is only at 



its infancy. However efficient software already exists such as 94 for tetrahedral mesh 
generation in domains of 1R 3 . A description of such an algorithm can be found in 16 



as 



well as some applications to computational mechanics. These software take as input a field 



h(z) of symmetric positive definite matrices and attempt to create a mesh satisfying (3.68 ). 
Defin ing the set of symmetric matrices A! n in a similar way as in the two dimensional case 
( |3.65 ), let us consider a continuous function M.^ d : H m( 2 — > S d such that 



AA(tt) G A: and detAO) < K(M{detM ; M G A' n } + e), 



(3.100) 



where e > can be chosen arbitrarily small and where K is an absolute constant 
independent of e. The existence of such a function is established in Chapter 6. Let 
M(z) := M^ d (d m f(z)), let 5 > and let 



h(z) :=6~ T (det M(z))^M(z) 



(3.101) 



A heuristic analysis similar to the one developed in §3.4 suggests that mesh generation 
based on this metric leads to a mesh T of Q optimally adapted for approximating / 
with P m _i elements in the W l,p semi norm. This justifies the search for functions Ai md 



satisfying (3.100) 



The form of Ai 2 ,d, which corresponds to piecewise linear finite elements, is already 



established, see for instance 85 , but we recall it for completeness. The same analysis as 



in £3.4.2 shows that 



M 2 , d {*) : = 4[tt] 2 

satisfies A^dM G A' n and det A^2,d( 7r ) = infjdetM ; M E A' w }. As a byproduct we 



obtain from Lemma 3.6.5 that there exists a constant C = C(d) such that for all n G IH^d 



C-^ldetH < L 2id (n) < C^detM. 



For piecewise quadratic elements, we generalise (3.71) and define 



MUn) := yfcp + ■ ■ ■ + [d Xd n] 



Then y/dAi^ d (n) G A'^, but we have found that for each K > there exists n G H 3 ^ 
such that 

det M* 3d (ir) > irinf{detM ; M G A'J. 



The map M.% d may still be used for mesh adaptation through the formula (3.101) but 

this metric may not be optimal in the area where 7r 2 = d ^ is such that det d {jt z ) is 
not well controlled by inf {det M ; M G A' n }. 
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3.7 Final remarks and conclusion 



In this chapter, we have introduced asymptotic estimates for the finite element inter- 
polation error measured in the W l,p semi- norm, when the mesh is optimally adapted to a 
function of two variables and the degree of interpolation m — 1 is arbitrary. The approach 
used is an adaptation of the ideas developped in Chapter [2] for the LP interpolation error, 



and leads to asymptotically sharp error estimates, exposed in Theorems 3.1.1 and 3.1.2 
These estimates involve a shape function L m<p which generalises the determinant which 
appears in estimates for piecewise linear interpolation. The shape function has equivalents 



of polynomial form for all values of m, as established in theorems |3.5.1| and |3.5.2| Up to a 
fixed multiplicative constant, our estimates can therefore be written under analytic form 
in terms of the derivatives of the function to be approximated. 

In the case of piecewise linear and piecewise quadratic finite elements, we have pre- 



sented in £3.4 metrics which allow to produce near optimal meshes. This metric is new 



in the case of quadratic elements. Some numerical experiments presented in §3.4.2| illus- 
trate the efficiency of this procedure, and the C++ source code is freely available on the 
internet 



95 



We have partially extended these results to higher dimension, in particular we pro- 



vide a local error estimate (3.95) which leads to sufficient conditions for building meshes 



that satisfy the best possible estimate up to a multiplicative constant. A multidimensional 



asymptotical lower error estimate is proved in Theorem 3.6.7 and generalises the bidimen- 



sional study. The corresponding asymptotical upper estimate is presented in |3.6.9| but not 
proved. 

One of the main tools used throughout this chapter for the construction of an optimal 



partition is the measure of sliverness S(T) of a simplex, defined in (3.87), which has 



a geometrical interpretation as the distance from T to the set of acute simplices. This 
measure accurately distinguishes between good anisotropy, that leads to optimal error 
estimates, and bad anisotropy that leads to oscillation of the gradient of the interpolated 
function. Equivalent quantities can be found in 
context of optimal mesh adaptation. 



10 62 , but had not been used in the 



3.8 Appendix 



3.8.1 Proof of Lemma 3.3.2 



Let R n be the homothetic contraction of R by the factor 1 — n 1 and with the same 
barycenter. We define a partition V' n of R n into convex polygons as follows V' n := {R n H 



j>t . rpt e V Tn }. The triangles R, R n , and the partition V' n are illustrated on Figure 3.5 
Note that the normals to the faces of the polygons building the partition V' n belong to a 
family of only 6 elements (nj)i<j<6 : the normals to the faces of of R, and the normals 
to the faces of T. Hence only 6x5 different angles can appear in V' n , and we denote the 
largest of these by a < 7T. 

We now partition into triangles each convex polygon C G V' n using the Delaunay 
triangulation of its vertices. Note that the angles of the triangles partitionning a convex 
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Figure 3.5 - Left : The triangles R and R n . Right : The partition V' n of R. 



polygon C are smaller than the maximal angle of C, hence than a. We denote by the 



resulting triangulation of R n , as illustrated on the left of Figure 3.6 



We denote by E n the collection of n equidistributed points on each edge of R, described 



in item 2 of Lemma 3.3.2 We denote by E' n the set of vertices of the triangles in "P" that 
fall on dR' n . For each point p G E n , we draw an edge between p and the point of E' n 
which is the closest to p. This produces a partition of R \ R n into triangles and convex 
quadrilaterals. Eventually we partition each of these polygons C into triangles using the 
Delaunay triangulation of the point set C n (E n U E' n ), which produces a triangulation V n 



of R \ R n illustrated on the right of Figure 3.6 The triangles X" G V n obey 

diam(T') < (2 diami? + diamT)^ 1 = CrT 1 . 

Furthermore let L be the length of the edge of T' included in dR U dR n , and let H be the 
height of the triangle T" such that LH = 2\T'\. Then 

H > mm{\z - z'\ ; z G dR, z' G dR n } = en' 1 . 

where c > is independent of n. Let V be another edge of T", and let 9 be the angle of 
T' between the edges L and V . Then 



2\T'\ = LL' sin 9 = LH. 

herefore arcsin(^) < 9 < 
the angles of T' are smaller than it — arcsin(^). We eventually define Vt,u{R) '■= U V n 



hence sin 9 > diar ^ r /) > ^, and therefore arcsin(^) < 9 < n — arcsin(^). It follows that all 



Figure 3.6 - Left : detail of the partition of R n . Right : the partition V n = V T)Tl (R) = 
U V n of R. 
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and we observe that the largest angle of a triangle in Vr,n(R) is bounded by the constant 
(3(R, T) = max{a, 7r — arcsin(^)} < 7T which is independent of n. Hence 



sup sup S(T) < tan 

ri>i Tev T ,n(R) 



f3(R,T) 



< oo 



The other properties of Vt,u(R) mentionned in 3.3.2 are easily checked. 



3.8.2 Proof of Theorem 13X2 

Let m > 2 be arbitrary and let s m 



LyJ + 1. We have proved in chapter £j2j 



Proposition 2.2.1, that for all n G H, m the three following properties are equivalent 



K £ m (n) = 0, 

There exists a, G R and n G H m _ Sm such that n = (ax + (3y) Sm 7i, 
There exists a sequence (0 n )n>o, 0n G SL 2 such that 7r o ra — y 0. 



(3.102) 



We also proved in chapter £}2j Theorem 2.6.3, the following invariance property : Let Q 
be a polynomial on H m such that ~ where r = degQ. Then 



Q(tt o 0) = (det0) r ?Q(7r) for all vr G IH m and G M 2 (I 



(3.103) 



It immediately follows that the polynomials (Qk)o<k<r, defined in (3.84), satisfy for all 
7Ti, 7r 2 G H m and G M 2 (R) 



Q fc (7Ti o 0, 7r 2 o 0) = (det 0) 2 Qfc(7Ti, 7r 2 ) for all 7Ti, 7r 2 G H m and all G M 2 (R). (3.104) 
We define two functions on H m x H m 



K*(7Ti,7T 2 ) := 



2 *7 ( 5fc( 7r i' 7r 2) 2 and K(7Ti,7r 2 ) := 

0<fc<r 



2J- 



where Q is such that iT| m ~ a/ Q, r := degQ. We show below that if ~ i£* on H m x H m . 



This result combined with (3.80) concludes the proof of Theorem 3.5.2[ (Note that m is 



replaced with m + 1 in the statement of this theorem.) Using (3.104) and re marking the 
invariance property Q(no(j)) = (det 4>) rm Q (it), for the same reasons as (3.103), we obtain 



for all 7Ti, 7r 2 G H m and all G M 2 (R), 



= | det 0| ^K(-ki,7T 2 ), 
= | det0|^iT*(7ri,7r 2 ). 

(3.105) 

If K(ir 1 , 7r 2 ) = 0, then nf + -k\ G H 2m has a linear factor of multiplicity s 2m = m + 1 



K(7Tl O 0, 7T 2 o 
^*(7Ti O 0, 7T 2 O 



according to (3.102), and therefore ~K\ and 7r 2 have a common linear factor of multiplicity 



If i^*(7Ti,7r 2 ) = 0, then for all k, < < r, we have <3/c(ti"i, vt 2 ) = 0. Using (3.84) we 
obtain that for all u,v G R we have K m (wri + t>7r 2 ) = 0. It follows from (3.102) that for 



all u,v G R the polynomial wn\ + t>7r 2 G IH m has a linear factor of multiplicity s m , hence 
that 7Ti and 7r 2 have a common linear factor of multiplicity s m . 
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Hence the following properties are equivalent 



#(71-1,712) =0, 

if*(7ri,7r 2 ) = 0, 

There exists a, j3 G R and 7fi, 7r 2 G H m _ Sm 

such that 7Ti = (ax + (3y) Sm ni, and 7r 2 = (ax + (3y) Sm TT 2 . 



(3.106) 



Using (3.102) we find that these properties are also equivalent to 

*LM + *!) = °. 

There exists a sequence (4> n )n>o, 4>n G SL 2 , such that (7Ti o </> n ) 2 + (7r 2 o n ) 2 — > 0, 
There exists a sequence (0 n )n>o, 0n G SL 2 , such that tti o (p n — >■ and tt 2 o — >■ 0. 

(3.107) 

We now define the norm ||(7Ti, 7r 2 )|| := supu^j \(tti(u), 7r 2 (u))| on H m x H m and 

7":= {(7Ti,7T 2 ) G M m x H m ; ||(tti, tt 2 )|| = 1 and ||(tti o (f>, tt 2 o <j>)\\ > 1 for all <p G SL 2 }. 

J 7 is compact subset of H m x H m and K as well as do not vanish on J 7 according to 
(3.106) and (3.107). Since these functions are continuous, there exists a constant Co > 
such that 

C Q l K < K* < C K on T. (3.108) 

Let 7ri, 7r 2 G H m . If there exists a sequence (0 n )n>o, 0n G SL 2 , such that n\ o n — >• and 
7i"2 ° 0n — >■ 0, then /r(7ri,7T 2 ) = ^(tti, 7r 2 ) = 0. Otherwise, consider a sequence ((p n )n>o, 
4> n G SL 2 such that 



lim ||(7ri ° <Pn,^2 O 



inf ||(7Ti O 0, 7T 2 O 

</>eSL2 



By compactness there exists a pair (ffi, 7r 2 ) G H m x H m and a subsequence 
that 



-> (7ri,vr 2 J. 



One easily checks that g J 7 . Using (3.105) we obtain 



-#(7Tl,7r 2 ) _ #(7Ti O n , 7T 2 O W ) 

#*(7ri, 7T 2 ) n^oo ^(711 O n , 7T 2 O </y 



if(7Ti,7r 2 ) 

X*(7Ti,7f 2 ) 



such 



Using (3.108) and the homogeneity of K and if*, we obtain that C 1 # < < CqK on 
H m x H m which concludes the proof. 



3.8.3 Proof of Proposition 3.6.1 



We denote by T eq a rf-dimensional equilateral simplex such that bary(T eq ) = 0, where 
bary denotes the barycenter, and such that its vertices < i < d, belong to the unit 
sphere, i.e. \qt\ = 1. Since the vertices of T eq play symmetrical roles there exists a constant 
£ G R such that 

For all < i < d, < j < d, < k < d, one has — qj, q^) = £(^fc — Sjk), (3.109) 
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where 5 is the Kronecker symbol : 5y = 1 if i = j, and otherwise. Using the relation 
Qo + " " " + Qd — we obtain £d = Yl*j=o(Qo ~ Qji Qo) = d + 1 hence £ = 1 + |. Note also that 
the unit exterior normal to the face of T eq opposite to the vertex is — g$. 

We recall the following property : if A G GL^ and if n is the exterior normal to a face 
T of a simplex T, then the exterior normal to the face A(F) of A(T) is 

(y4 _1 ) T n 

(3.110) 



n 



-n 



We first establish that for any simplex T, the simplex M T 2 (T) is acute. Without loss 
of generality we can assume that bary(T) = 0, hence there exists A G GL^ such that 
T = A (T eq ). Since the vertices of T are V{ = Aqi for < i < d, we obtain from definition 
(J3~88j) that 

(?< - Qj)(Qi ~ QjV 



M 7 




\A( qi - qj )\' 



A 1 



According to (3.110), the exterior normal to the face of the simplex T ac := M T 2 (T) 
_ i 

M T 2 y4(T eq ) opposite to the vertex Vi is 

in = -UiM^A^fqi 



where > 0. For all < a < b < d, we therefore obtain using (3.109|) 
u a u b (n a ,n b ) = (M^A- 1 )^, m|(A- ] > j 



Qb 



1\T 



Qb 



g a T A" 1 M T (A- 

(<5oi - #aj)(<5fti - hj) 



Qb 



o<«<i<rf 
\A(q a -q b W 



\Mw)\ 2 

< 0. 



This establishes that the simplex T ac := M T 2 (T) is acute, and therefore that S(T) < 
k(V^Mt) since 1 < ||V^|| < a d . 

_ i 

The rest of this appendix is devoted to the proof that \\M T 2 \ \ < S(T ), which implies 

For this we need 



3.6.1 



that k(i/Mt) < adS(T) and thus concludes the proof of Theorem 
the following lemma. 

Lemma 3.8.1. For any acute simplex T ac one has Mj- ac > Id. 

Proof: Without loss of generality we can assume that bary(T ac ) = 0, hence there exists 
A G GLd such that T ac = A(T cq ). The vertices of T ac are q = Aqi for < % < d, and the 
exterior normal to the face of T ac opposite q is 



-1\T 



Qi 
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where /jj > 0. We define for all < i < j < d 

■\ci - Cj\ 2 (xni,xaj) 



Since T ac is acute we have (m^nij) < and therefore \j > 0. We now introduce the 
symmetric matrix 



M:= WvfS where & : = LT3^T ( 3 - 1U ) 

0<i<j<d 



I Q Cj | 



For all < a < b < d we obtain using the relation (rrij, Cj) = —jii{qi, qj) that 
Tar x (Qa,qi- qj}{qb,qi- qj) 



0<i<j<d 



0<i<j<d 



I Q Cj | 



\C a ~ C b 



ab I \ 

= (m a ,m 6 ). 



Therefore mjMmt, = m^m^ for all < a < b < d, which implies that M = Id. Further- 
more for all < a < b < d, we have 

1 = \fab\ 2 = fJbMfab = 22 ^Afabi fij) 2 > Kb- 

0<i<j<d 

It follows that in the sense of symmetric matrices, 

•W, : J2 -lulu > E A^/J = M = Id, 

0<i<j<d 0<i<j<d 

which concludes the proof of this lemma. 



We now conclude the proof of inequality (3.90). Let T be an arbitrary simplex, and let 
ip G GL rf be such that the simplex T ac := if)(T) is acute. Let v «, < i < d be the vertices 
of T and c, = ip(vi) the vertices of T ac . We define the vectors and similarly to (3.88) 



and (3.111) 



Vj - vj Ci - Cj _ ip{eij) 

&ij • I I ; Jij ■- 



\Vi-Vj\ \Ci-Cj\ \ip{e ijy 



for all < i < j < d. For any v G M. d we therefore have 

Tac ^ |^(e i7 -)| 2 

0<i<j<d lrV %3n 
0<«<i<d 

= ||^-1 2 (v^) t m t (^). 
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Using the previous lemma and defining u := ip T v we obtain 

\u\ 2 < uf\v\ 2 < \m 2 v T M Tac v< wn^w 2 u t m tU = (imu-'wmui) 2 , 

hence \\M T 2 \\ < K,(ip). Recalling that ||\/M T || < a d we obtain 

k( v / Mt) = IImJiIHM/1 < a d K(i/>). 
We conclude the proof by taking the infimum among all ip such that ip{T) is acute. 
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4.1 Introduction 

There exists various ways of measuring the smoothness of functions on a domain 
£1 C IR d , generally through the definition of an appropriate smoothness space. Classical 
instances are Sobolev, Holder and Besov spaces. Such spaces are of common use when 
describing the regularity of solutions to partial differential equations. From a numerical 
perspective, they are also useful in order to sharply characterize at which rate a function / 
may be approximated by simpler functions such as Fourier series, finite elements, splines 



or wavelets (see 30 42 45 for surveys on such results). 

Functions arising in concrete applications may have inhomogeneous smoothness pro- 
perties, in the sense that they exhibit area of smoothness separated by localized discon- 
tinuities. Two typical instances are (i) edge in functions representing real images and (ii) 
shock profiles in solutions to non-linear hyperbolic PDE's. The smoothness space that is 
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best taylored to take such features into account is the space BV(fl) of bounded variation 
functions. This space consists of those / in such that V/ is a bounded measure, 

i.e. such that their total variation 

TV(/) = \f\ BV := max j£/div(^) ; <p G V(Q) d , |M| L ~ < l| 

is finite. Functions of bounded variation are allowed to have jump discontinuities along 
hypersurfaces of finite measure. In particular, the characteristic function of a smooth 
subdomain D C Q has finite total variation equal to the d — 1-dimensional Hausdorff 
measure of its boundary : 

\X D \ B v=U d - l {dD). (4.1) 
It is well known that BV is a regularity space for certain hyperbolic conservation laws 



57 68 , in the sense that the total variation of their solutions remains finite for all time 



t > 0. This space also plays an important role in image processing since the seminal 



paper 54 . Here, a small total variation is used as a prior to describe the mathematical 
properties of "plausible images", when trying to restore an unknown image / from an 
observation h = Tf + e where T is a known operator and e a measurement noise of norm 
H e IU 2 — £• The restored image is then defined as the solution to the minimization problem 

mm {\g\ BV ; \\Tg - h\\& < e}. (4.2) 

g£BV 



From the point of view of approximation theory, it was shown in 32 , 33 that the space 
BV is almost characterized by expansions in wavelet bases. For example, in dimension 
d — 2, if / = ^dxipx is an expansion in a tensor-product L 2 -orthonormal wavelet basis, 
one has 

(d x ) ef^feBV^ (d x ) e w£\ 

where wi 1 is the space of weakly summable sequences. The fact that the wavelet coeffi- 
cients of a BV function are weakly summable implies the convergence estimate 

Wf-fNh^CN-VyiBv, (4.3) 

where /jy is the nonlinear approximation of / obtained by retaining the N largest coef- 
ficients in its wavelet expansion. Such approximation results have been further used in 
order to justify the performance of compression or denoising algorithms based on wavelet 



thresholding 31,46,47 



In recent years, it has been observed that the space BV (and more generally classical 
smoothness spaces) do not provide a fully satisfactory description of piecewise smooth 



functions arising in the above mentioned applications. Indeed, formula (4.1) reveals that 
the total variation only takes into account the size of the sets of discontinuities and not 
their geometric smoothness. In image processing, this means that the set of bounded 
variation images does not make the distinction between smooth and non-smooth edges as 
long as they have finite length. 

The fact that edges have some geometric smoothness can be exploited in order to study 
approximation procedures which outperform wavelet thresholding in terms of convergence 
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rates. For instance, it is easy to prove that if / = Xd where D is a bidimensional domain 
with smooth boundary, one can find a sequence of triangulations Tn with N triangles 
such that the convergence estimate 

\\f-l TN f\W<CN-\ (4.4) 

holds, where If denotes the piecewise linear interpolation operator on a triangulation T ■ 
Other methods are based on thresholding a decomposition of the function in bases or 
frames which differ from classical wavelets, see e.g. (3.21. 80 . These methods also yield 



improvements over (4.3) similar to (4.4). The common feature in all these approaches is 



that they achieve anisotropic refinement near the edges. For example, in order to obtain 



the estimate (4.4), the triangulation Tn should include a thin layer of triangles which 
approximates the boundary 3D. These triangles typically have size N~ 2 in the normal 
direction to 3D and iV -1 in the tangential direction, and are therefore highly anisotropic. 

Intuitively, these methods are well adapted to functions which have anisotropic smooth- 
ness properties in the sense that their local variation is significantly stronger in one direc- 
tion. Such properties are not well described by classical smoothness spaces such as BV, 
and a natural question to ask is therefore : 

What type of smoothness properties govern the convergence rate of anisotropic refine- 
ment methods and how can one quantify these properties ? 

The goal of this chapter is to answer this question, by proposing and studying mea- 
sures of smoothness which are suggested by recent results on anisotropic finite element 
approximation (4,27 and Chapter 2. Before going further, let us mention several exis- 



ting approaches which have been developed for describing and quantifying anisotropic 
smoothness, and explain their limitations. 

1. The so-called mixed smoothness classes have been introduced and studied in order 
to describe functions which have a different order of smoothness in each coordinate, 



see e.g. 76,88 . These spaces are therefore not adapted to our present goal since 
the anisotropic smoothness that we want to describe may have preferred directions 
that are not aligned with the coordinate axes and that may vary from one point to 
another (for example an image with a curved edge). 

Anisotropic smoothness spaces with more general and locally varying directions have 
been investigated in (64]. Yet, in such spaces the amount of smoothness in different 
directions at each point is still fixed in advance and therefore again not adapted to 
our goal, since this amount may differ from one function to another (for example 
two images with edges located at different positions). 

A class of functions which is often used to study the convergence properties of 
anisotropic approximation methods is the family of C m — C n cartoon images, i.e. 
functions which are C m smooth on a finite number of subdomains (fii)i=i,... ,& sepa- 
rated by a union of discontinuity curves (^j)j=i,-,i that are C n smooth. The defects 
of this class are revealed when searching for simple expression that quantifies the 
amount of smoothness in this sense. A natural choice is to take the supremum of 
all C m (Qi) norms of / and C n norms of the normal parametrization of Tj. We then 
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observe that this quantity is unstable in the sense that it becomes extremely large 
for blurry images obtained by convolving a cartoon image by a mollifier <ps = j^^i^) 
as 5 — > 0. In addition, this quantity does not control the number of subdomains in 
the partition. 



4. A recent approach proposed in 43 defines anisotropic smoothness through the geo- 
metric smoothness properties of the level sets of the function /. In this approach 
the measure of smoothness is not simple to compute directly from / since it involves 
each of its level sets and a smoothness measure of their local parametrization. 

The results of [4 27 and Chapter [2] describe the L p -error of piecewise linear inter- 



polation by an optimally adapted triangulation of at most N elements, when / is a C 2 
function of two variables. This error is defined as 

Mf) P -= Jrf Jf-Irfhp. 

#(T)<N 

It is shown in [I] for p = oo and in Chapter [2] for all 1 < p < oo that that 

hmsupA^v(/) p <CVL p (/), (4.5) 

N->+oo 



where C is an absolute constant and 



1 1 



Mf)-= W\tet(<Pf)\\\ L r, -:=! + -• (4-6) 

Moreover, this estimate is known to be optimal in the sense that lim inf jv-s.+oo Na^{f) p > 
cAp(f) also holds, under some mild restriction on the class of triangulations in which one 
selects the optimal one. These results are extended in Chapter[2]to the case of higher order 
finite elements and space dimension d > 2, for which one can identify similar measures 
/ !->■ A(f) governing the convergence estimate. Such quantities thus constitute natural 
candidates to measure anisotropic smoothness properties. Note that A p (f) is not a semi- 



norm due to the presence of the determinant in (4.6), and in particular the quasi-triangle 
inequality A p (f + g) < C(A p (f) + A p (g)) does not hold even with C > 1. 

This chapter is organized as follows. We begin in §4.2| by a brief account of the avai- 
lable estimates on anisotropic finite element interpolation, and we recall in particular the 



argument that leads to (4.5) with the quantity A p (f) defined by (4.6). 

Since A p (f) is not a norm, we cannot associate a linear smoothness space to it by a 
standard completion process. We are thus facing a difficulty in extending the definition of 
A p (f) to functions which are not C 2 -smooth and in particular to cartoon images such as 



in item 3 above. Since we know from (4.4) that for such cartoon images the L 2 error of 



adaptive piecewise linear interpolation decays like N 1 , we would expect that the quantity 

A 2 (/) = ||V|det(^/)|||^ /3 , 

corresponding to the case p = 2 can be properly defined for piecewise smooth functions. 



We address this difficulty in §4.3| by a regularization process : if / is a cartoon image we 
introduce its regularized version 

fs-=f* 96, (4.7) 
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where ips = s^(^) is a standard mollifier. Our main result is the following : for any 
cartoon image / of C 2 — C 2 type, the quantity A 2 (fs) remains uniformly bounded as 
(5—7-0 and one has 



limA 2 (/^ /3 



5^0 



<r[ |v/|detW)l| 2/3 + ^)^ ! |[/](^)I 2/3 |^)i 1/3 ^, (4.8) 



where [f](s) and k(s) respectively denote the jump of / and the curvature of Tj at the 
point s, and where C(ip) is a constant that only depends on the choice of the mollifier. 
This constant can be shown to be uniformly bounded by below for the class of radially 
decreasing mollifiers. This result reveals that A 2 (f) is stable under regularization of car- 
toon images (in contrast to the measure of smoothness described in item 3 above). We 
also discuss the behaviour of A p when p ^ 2. 



These results lead us in £4.4 to a comparison between the quantity A 2 (f) and the 



total variation TV(/). We also make some remarks on the existing links between the limit 
and classical results on adaptive approximation of curves, as well as 



expression in Q4.8| ) 

with operators of affine-invariant image processing which also involve the power 1/3 of 
the curvature. 



We devote £4.5 to numerical tests performed on cartoon images that illustrate the 



validity of our results. In addition, we compare the quantity A 2 (f) with the total variation 
TV(/) as a model for plausible images. 

One could be tempted to use the quantity A 2 (f) in place of the total variation TV(/) 
as a prior in image processing, and in particular to replace \g\sv by A 2 (g) in a restoration 



procedure such as (4.2). In £4.6, we show that this optimization program is ill-posed in the 



sense that the restored image would be the noisy image itself. An alternate strategy is to 



use A 2 (f) in the framework of a bayesian least-square estimator, as proposed in 70 in the 
case of the total variation. At the present stage, the algorithm implementing this approach 
did not give satisfactory results due to its very slow convergence. For this reason, we 
only present some numerical illustration of this approach in a simplified one-dimensional 
setting. 



Eventually we describe in §4.7| the extension of our results to finite elements of higher 
degree and higher space dimensions. Concluding remarks and perspectives of our work 



are given in £4.8 



4.2 Anisotropic finite element approximation 

A standard estimate in finite element approximation states that if / G W 2 ' P (Q) then 

Wf-lrJ\\L P <Ch 2 \\d 2 f\\ LPl 

where 7^ is a triangulation of mesh size h := max.TeTh diam(T). If we restrict our attention 
to a family quasi-uniform triangulations, h is linked with the complexity N := #(7/i) 
according to 

dhr 2 <N< C 2 h- 2 
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Therefore, denoting by <j]f l{ (f)LP the LP approximation by quasi-uniform triangulations 
of cardinality N, we can re-express the above estimate as 

vT\f)Lv<CN- l \\d 2 f\\ LP . (4.9) 

In order to explain how this estimate can be improved when using adaptive partitions, 
we first give some heuristic arguments which are based on the assumption that on each 
triangle T the relative variation of d 2 f is small so that it can be considered as constant 
over T, which means that / coincides with a quadratic function on each T. Denoting 
by It the local interpolation operator on a triangle T and by er(/) p : = ||/ — It/||lp(t) 
the local LP error, we thus have according to this heuristics 

11/ - It fh* = e T (f X) ' = (£ ^(Sr)?) 1 
TeT TeT 

We are thus led to study the local interpolation error eT(q) p when q e P 2 is a a quadratic 
polynomial. Denoting by q the homogeneous part of q, we remark that 

eT(q) P = e T (q) P . 

We optimize the shape of T with respect to the quadratic form q by introducing a function 
K p defined on the space of quadratic forms by 

#p(q) : = |T n J i e T (q) P , 

where the infimum is taken among all triangles of area 1. It is easily seen that er(q) p is 
invariant by translation of T and so is therefore the minimizing triangle if it exists. By 
homogeneity, it is also easily seen that 

inf e T {q) p = a-K p {q), - = - + 1, 

|T|— a T p 

and that the minimizing triangle of area a is obtained by rescaling the minimizing triangle 
of area 1 if it exists. Finally, it is easily seen that if tp is an invertible linear transform 

K p (qo V ) = \det(<p)\K p (q), 

and that the minimizing triangle of area |det(</9)| -1 for qo(p is obtained by application of 
Lp^ 1 to the minimizing triangle of area 1 for q if it exists. If det(q) ^ 0, there exists a tp such 
that qotp is either x 2 +y 2 or x 2 — y 2 up to a sign change, and we have |det(q)| = |det((/2)|~ 2 . 
It follows that K p (q) has the simple form 

K p (q)=a|det(q)| 1 / 2 , (4.10) 

where a is a constant equal to K p (x 2 + y 2 ) if det(q) > and to K p (x 2 —y 2 ) if det(q) < 0. 
One easily checks that this equality also holds when det(q) = in which case K p (q) = 0. 
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Assuming that the triangulation T is such that all its triangles T have optimized shape 
in the above sense with respect to the quadratic form q r associated with q T , we thus have 
for any triangle T G T 



er(f) P = e T {qr)p = \T\-K p (q T ) 



d 2 f 



L T {T) 



since we have assumed = qr on T. In order to optimize the trade-off between the global 
error and the complexity N = #(T), we apply the principle of error equidistribution : 
the triangles T have area such that all errors ej^q-zOp are equal i.e. e-r(qT) P = V for some 
7] > independent of T. It follows that 



Nr] T < 



K r . 



d 2 f 



and therefore 



cr N (f) p <\\f-l T f\\Lv <N^ V < 



K r 



d 2 f 



which according to (4.10) implies 



M/)p < CN-'A^f) 



(4.11) 



with A p defined as in (4.6). 

The estimate ( |4.11 ) is too optimistic to be correct : if / is a univariate function then 
A p (f) = while cjv(/) p may not vanish. In a rigorous derivation such as in |2j and Chapter 
2, one observes that if / G C 2 , the replacement of cPf by a constant over T induces an 
error which becomes negligible only when the triangles are sufficiently small, and therefore 
a correct statement is that for any e > there exists N = N (f,e) such that 



a N (f) P < N- 1 



d 2 f 



+ 6 



(4.12) 



L T (Q) 



for all N > JVn, i.e. 



limsup No~N(f)p < 



K r 



d 2 f 



(4.13) 



which according to (4.10) implies (4.5). 



4.3 Piecewise smooth functions and images 

As already observed, the quantities A p (f) are well defined for functions / G C 2 , but 
we expect that they should in some sense also be well defined for functions representing 
C 2 — C 2 "cartoon images" when p < 2. We first give a precise definition of such functions. 
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Definition 4.3.1. A cartoon function on an open set f2 is a function almost everywhere 
of the form 

f — ^ fiXcii, 
i<i<k 

where the f2j are disjoint open sets with piecewise C 2 boundary, no cusps (i.e. satisfying an 
interior and exterior cone condition), and such that fl = U* =1 fij, and where the function 
fi is C 2 on Qi for each 1 < i < k. 

Let us consider a fixed cartoon function / on an open polygonal domain Q (i.e. Q is 
such that fl is a closed polygon) associated with a decomposition (ri i ) 1 < i < fe . We define 



r:= |J dQi, (4.14) 



Ki<k 



the union of the boundaries of the fli. The above definition implies that T is the disjoint 
union of a finite set of points V and a finite number of open curves (Ti)i<i<i- 



■(U 



Ti) U V. 

i<i<i 

Furthermore for all 1 < i < j < I, we may impose that Ti fl Tj C V (this may be ensured 
by a splitting of some of the if necessary). 

We now consider the piecewise linear interpolation 1^ / of / on a triangulation 7jv of 
cardinality N. We distinguish two types of elements of 7jv- A triangle T G T/v is called 
"regular" if T n Y — 0, and we denote the set of such triangles by T^. Other triangles are 
called "edgy" and their set is denoted by 7^. We can thus split fl according to 

fi := (u tgT ^t) u (u Ter ^T) = sr N u fi%. 

We split accordingly the L p interpolation error into 

11/ -Wllk (n) = / l/-Ir,/l p + / i/-ir w /r 

We may use O(N) triangles in 7^ and (for example iV/2 in each set). Since / has 
discontinuities along T, the L°° interpolation error on fl e N does not tend to zero and 7^ 
should be chosen so that fl e N has the aspect of a thin layer around T. Since T is a finite 
union of C 2 curves, we can build this layer of width 0(N~ 2 ) and therefore of global area 
\Q%\ < CN~ 2 , by choosing long and thin triangles in T§. On the other hand, since / is 
uniformly C 2 on fl r N , we may choose all triangles in 7^ of regular shape and diameter 
h T < CN~ 1 / 2 . Hence we obtain the following heuristic error estimate, for a well designed 
anisotropic triangulation : 



ll/-Ir w /l|LP(n) = (ll/-Ir,/III w + ll/-Ir,/III P( ^)) 

< (11/ - W N fWl-^m + ||/ - i Tn f\\ p L ^jn%\) 

< C(N- p + N- 2 ) 1/p , 



1/p 

1/p 
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and therefore 

Wf-lT N f\\Lv m <CN-^2M } (4 . 15) 

where the constant C depends on ||G? 2 /||£oo(n\r), ||/|| 1,00(0) and on the number, length and 
maximal curvature of the C 2 curves which constitute T. 

Observe in particular that the error is dominated by the edge term ||/ — lr N ) 
when p > 2 and by the smooth term ||/ — lj- N f\\LP(n r N ) when p < 2. For the critical value 
p = 2 the two terms have the same order. 

For p < 2, we obtain the approximation rate iV -1 which suggests that approxima- 



tion results such as (4.5) should also apply to cartoon functions and that the quantity 
A p (f) should be finite. We would therefore like to bridge the gap between anisotropic 
approximation of cartoon functions and smooth functions. For this purpose, we first need 
to give a proper meaning to A p (f) when / is a cartoon function. This is not straightfor- 
ward, due to the fact that the product of two distributions has no meaning in general. 
Therefore, we cannot define det(d 2 f) in a distributional sense, when the coefficients of 
d 2 f are distributions without sufficient smoothness. Our approach will rather be based on 
regularisation. This is additionally justified by the fact that sharp curves of discontinuity 
are a mathematical idealisation. In real world applications, such as photography, several 
physical limitations (depth of field, optical blurring) impose a certain level of blur on the 
edges. 

In the following, we consider a fixed radial nonnegative function (p of unit integral and 
supported in the unit ball, and we define for all 6 > and / defined on Q, 

Vt( z ) '■= -pV (?) and fs = f * <Ps- (4.16) 

Our main result gives a meaning to A p (f) based on this regularization. If / is a cartoon 
function on a set Q, and if x G Y \ V, we denote by [f](x) the jump of / at this point. We 
also denote t(x) and n(x) the unit tangent and normal vectors to T at x oriented in such 
way that det(t, n) = +1, and by k(x) the curvature at x which is defined by the relation 

d t (x)t(x) = K(x)n(x). 

For p G [1, oo] and r defined by \ := 1 + -, we introduce the two quantities 



S p (f) := |MdetW)IIUw) = A>(W, 
E P (f) ■■= \\VW\lf}\\LT(T h 



which respectively measure the "smooth part" and the "edge part" of /. We also introduce 
the constant 

C PtV := \\y/\$&\\\ L T (R) , where := / <p(x,y)dy. (4.17) 

Note that fs is only properly defined on the set 

tt 5 := {z G Q ; B(z,6) C Q}, 
and therefore, we define A p (fg) as the L T norm of a/| det(d 2 fs)\ on this set. 
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Theorem 4.3.2. For all cartoon functions f , the quantity A p (f$) behaves as follows : 
- Ifp<2, then 



hmA p (f s ) = S p (f). 

o— >(J 



- If p = 2, then r = | and 

fmA 2 (f s ) = (S 2 (f) r + E 2 (fYC^Y . (4.18) 

- If p > 2, then A p (fs) — > oo according to 

hmJ^A p {f s ) = E p {f)C p>v . (4.19) 

Remark 4.3.3. This theorem reveals that as 5 — > 0, the contribution of the neighbourhood 
of T to A p (fs) is negligible when p < 2 and dominant when p > 2, which was already 
remarked in the heuristic computation leading to (4.15). 

Remark 4.3.4. It seems to be possible to eliminate the "no cusps" condition in the defi- 
nition of cartoon functions, while still retaining the validity of this theorem. It also seems 
possible to take the more natural choice f(z) = ^e~H z H , which is not compactly supported. 
However, both require higher technicality in the proof which we avoid here. 



Before attacking the proof of Theorem 4.3.2 we show below that the constant C p>tp 
involved in the result for p > 2 is uniformly bounded by below for a mild class of mollifiers. 

Proposition 4.3.5. Let if be a radial and positive function supported on the unit ball 
such that J if = 1 and that <p(x) decreases as \x\ increases. For any p > 2 we have 

c > 2f 4 



7T \T + Z 

and this lower bound is optimal. There is no such bound if p < 2, but note that Theorem 



4-3.2 does not involve C p(p for p < 2. 



Proof: Let D be the unit disc of 1R 2 . We define a non smooth mollifier ip and a function 
\1/ as follows 

ip := — — and ^/(x) := / tp(x,y)dy. 

7T ./n 



One easily obtains that ty(x) = |a/1 — x 2 X{-i,i]{x) and ty'(x) = ^^=3 X[-i,i](^), hence 

— Ax 

= —X[-i,i](x). 

For all 5 > we define 1/J5 := d^ipid -1 -), and \£ / «5(x) := f R ips(x,y)dy. Similarly we obtain 

4-r 

Vs(x)V' s (x) = —X hS , 5] S-\ 
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Hence 



C, 



*s(x)\%(x)\ 
2 /255 +1 ^ 



ttS 2 V f + 1 



4 



7i V r + 2 



i _ i 

<5f 2 . 



Note that 



If p > 2 and S E (0, 1] then C Ms > C, 



71 V T + 2 



(4.20) 



The mollifier of interest is radially decreasing, has unit integral and is supported on the 
unit ball. It follows that there exists a Lebesgue measure [i on (0, 1], of mass 1, such that 

(f= ip s dfi(5). 
Jo 

Hence $(a;) := J R (f(x,y)dy = L 1 ^s(x) dfi(5), for any x E M. Since s i— >■ s T is concave on 



when < r < 1, we obtain 



^s(x) dfi{S)) > [ ^ s (x) T du.(5) 



Similarly, since the sign of *&'$(x) is independent of 5, \&(x)\ T = ( £ \%{x)\ dfi{5)J > 
Jq \ty' 5 (x)\ T dp,{8). Applying the Cauchy-Schwartz inequality we obtain 







Eventually we obtain using the previous equation and (4.20) that 

•i 



$|$'| > 







* 5 \%\ dn(5) = / CI, dfi(S) > CI 







which concludes the proof of this lemma. 







The rest of this section is devoted to the proof of Theorem 4.3.2 Since it is rather 
involved, we split its presentation into several main steps. 
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Step 1 : decomposition of A p (fg). Using the notation K(M) := y | det M\, we can 
write 

A p (f s y = I K(d 2 f 5 y. (4.21) 
We decompose this quantity based on a partition of fl 5 into three subsets 

q 5 = n s ur s uv s . 

The first set fig corresponds to the smooth part : 

Clg := &i,s, where Cli >s := {z G Cli ; d(z, \ flj) > 5}. 

l<i<k 

Note that fig is strictly contained in fl s . The second set corresponds to the edge part : we 
first define 

T °s ■= U r ?>- where T l ■= i z E r i ; r \ > 25 ^ 

i<j<i 

and then set 

Tg := I^J Tj )S where Tj )S := {z G fl ; d(z,T) < 5 and vr r (z) G T° js } 
i<j<i 

where ffr(z) denotes the point of T which is the closest to z. The third set corresponds 
the corner part : 

p s :=Q?\(n s \jr s ). 

The measures of the sets Tg and Vs tends to as 5 — > 0, while \flg\ tends to \fl\. More 
precisely, we have 

\T S \ < C5 and \V S \ < C5 2 
where the last estimate exploits the "no cusps" property of the cartoon function. We ana- 



lyze separately the contributions of these three sets to (4.21) 



Step 2 : Contribution of the smooth part fig. The contribution of fig to the in- 



tegral (4.21) is easily measured. Indeed, let us define 

q {z) := J K(d 2 fg(z)Y if z G fig, 
[ otherwise. 

Then we have pointwise convergence Qg(z) — > K(d 2 f(z)) T on fl\T. Since the ^-neighbourhood 
of fig is included in fl \ T, we have 

\\d 2 (f * <Ps)\\l°°(si s ) = \\(d 2 f) * <P6\\l°°(cl s ) < ll^/IU-OAnlly^lUi = \\d 2 f\\ L °°(n\r) 

Since 



K(M) = V| det Af | < \\M\ 



4.3. Piecewise smooth functions and images 
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we have K(d 2 fs) < \\d 2 f\\L°°(n\r) on Q$, and we conclude by dominated convergence that 

lim / K(d 2 f 5 y = lim / Q 5 = [ K(d 2 f)\ 
Jn s Jn\r Jn\r 

Step 3 : Contribution of the corner part Vs- We only need a rough upper estimate 
of the contribution of Vs to the integral (4.21). We observe that 

\\ d2 (f * ¥>*)IU«(n) = 11/ * (dV*)||r«>(n) < ll/lk»(n)||dV*| 
where M := \\f\\L°°(n)\\d 2 ip\\L 1 (M. 2 )- It follows that 

/ p W<ini(^) 



M 
52' 



< C5 



2-2r 



If r < 1, this quantity tends to and is therefore negligible compared to the contribution 
of the smooth part. If r = 1, which corresponds to p = 00, our further analysis shows 
that the contribution of the edge part tends to +00, and therefore the contribution of the 
corner part is always negligible. 

Step 4 : Contribution of the edge part IV This step is the main difficulty of 
the proof. We make a key use of an asymptotic analysis of fs on r,5, which relates its 
second derivatives to the jump [/] and the curvature k as 5 — > 0. We first define for all 
5 > the map 

U s :T\Vx [-1,1] n 

(x,u) 1 — y x + 5un(x). 

We notice that according to our definitions, for 5 small enough, the map Us induces a dif- 
feomorphism between T° x [—1, 1] and Ts, such that n r (Us(x, u)) = x and d(Us(x, u),T) = 
\Us(x, u) — x\ = 8\u\. We establish asymptotic estimates on the second derivatives of fs 
which have the following form : 



d n , n fs(z)--[f](x)&(u 



d ttt f S (z) + -[f]( X )K(x)$(u) 





c 


< 






J 




c 


< 






J 


< 


u(8) 




5 



(4.22) 
(4.23) 
(4.24) 



where lim,5_>o ^(<5) = and with the notation z = Us(x,u). The constant C and the 
function u depend only on /. The proof of these estimates is given in the appendix. As 
an immediate consequence, we obtain an asymptotic estimate of K(d 2 fs) = yj det(d 2 fs)\ 
of the form 

5lK(d 2 f 5 (z)) -y^Y\\[f](x)Wmu)^(u)\\ < 0(5), (4.25) 
where lim^o ^(^) = 0, and the function Q depends only on /. Using the notations 
gs(z):=5lK(d 2 f s (z)), \(x) := y/\^x)~\\[f](x)\, /x(u) := y/\$(u)&(u)\, 
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we thus have 



\g s (z) - \(x)fj,(u)\ < u(5), 



(4.26) 



for all x G T®, u G [—1, 1] and 5 > sufficiently small, with z = U$(x,u). We claim that 



for any continuous functions (g$, A,//) satisfying (4.26), we have for any r > 



Jimr 1 / g 5 



X(x) T dx / fi(u) T du : 
r J-i 



(4.27) 



which is in our case equivalent to the estimate 



limtfa 7 " 1 



K(d 2 f s ) 



|[/]| r N r/2 / |$$'| T / 2 . 



(4.28) 



In order to prove ( 4.27[ ), we may assume without loss of generality that r = 1 up to 
replacing (gg,\,u) by (gj, A T , /i r ). We first express the jacobian matrix of Us using the 
bases B 1 = ((t(x),0), (0, 1)) and B 2 = (t(x),n(x)) for the tangent spaces of T x [—1, 1] 
and Q. This gives the expression 



[dU s (x,u)) Bl ,B 2 



1 — 5un(x) 



and therefore | det([dUs(x, u)]b u b 2 )\ — $ — 5 2 uk{x). Since B\ and B 2 are orthonormal 
bases, this quantity is the jabobian of Us at (x,u), and therefore 

/ 9s = $ gs(x + 5un(x))(l — 5uK(x))dx du 

Jr s ^rgx[-i,i] 



Combining with (4.26), and using dominated convergence we obtain (4.27). 



Step 5 : summation of the different contributions. Summing up the contributions 
of Qs, Vs and I"^, we reach the estimate 



K(d 2 f s y 



K(d 2 fs) T + I K(c?f 5 y+ I K(d 2 fs) T 



r2-2r 



/ K(d 2 fY + e 1 (5))+B(5)5 2 
Jn\r J 

+^ §T (/ \W\A T ' 2 \ \<5><5>'\ T/2 + e 2 (5) 
(S P (fY + + B(S)S 2 - 2 ^ + S^(E p (fyc; ;<p + s 2 (8)), 



where lim^o £ i(^) = h m <5->o £ 2(<^) = and B(5) is uniformly bounded. This concludes the 
proof of Theorem |4.3.2 
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Theorem 4.3.2 allows us to extend the definition of A 2 (f) when / is a cartoon function, 
according to 

A 2 (f) := (S 2 (/) 2 / 3 + E 2 (f)^C 2 2 !^ 3/2 . (4.29) 

We first compare this additive form with the total variation TV(/). If / is a cartoon 
function, its total variation has the additive form 

TV(/):=||V/|| Ll( ^ r) + ||[/]|| il(r) , (4.30) 



Both (4.29) and (4.30) include a "smooth term" and an "edge term". It is interesting to 



compare the edge term of A 2 (f), which is given by 

^(/) = ||VR[/]IUv3 ( r), 

up to the multiplicative constant C 2 ^, with the one of TV(/) which is simply the integral 
of the jump 

J(f) ■= WifWW), 

Both terms are 1-homogeneous with the value of the jump of the function /. In particular, 
if the value of this jump is 1 (for example when / is the characteristic function of a set of 
boundary T), we have 

m) = (f m 1/3 ) 3/2 , (4.3i) 



while «/(/) coincides with the length of T. In summary, A 2 (f) takes into account the 
smoothness of edges, through their curvature k, while TV(/) only takes into account their 
length. 

Let us now investigate more closely the measure of smoothness of edges which is 



incorporated in A 2 (f). According to (4.31), this smoothness is meant in the sense that the 
arc length parametrizations of the curves that constitute T admit second order derivatives 
in Li. In the following, we show that this particular measure of smoothness is naturally 
related to some known results in two different areas : adaptive approximation of curves 
and affine-invariant image processing. 



We first revisit the derivation of the heuristic estimate (4.15) for the error between 
a cartoon function and its linear interpolation on an optimally adapted triangulation. In 
this computation, the contribution of the "edgy triangles" was estimated by the area of 
the layer fl e N according to 

||/-iT W /IUp(n&)< Wfh- \n%\^. 

Then we invoke the fact that T is a finite union of C 2 curves Tj in order to build a layer 
of global area |Ojy-| < CN~ 2 , which results in the case p = 2 into a contribution to the L p 
error of the order 0{N~ 1 ). The area of the layer Q e N is indeed of the same order as the 
area between the edge T and its approximation by a polygonal line with 0(N) segments. 

Each of the curves Tj can be identified to the graph of a C 2 function in a suitable 
orthogonal coordinate system. If 7 is one of these functions, the area between T and its 



178 



Chapter 4. From finite element approximation to image models 



polygonal approximation can thus be locally measured by the L 1 error between the one- 
dimensional function 7 and a piecewise linear approximation of this function. Since 7 is C 2 , 
it is obvious that it can be approximated by a piecewise linear function on O(N) intervals 
with accuracy 0(N~ 2 ) in the L°° norm and therefore in the L 1 norm. However, we may 
ask whether such a rate could be achieved under weaker conditions on the smoothness 
of 7. The answer to this question is a chapter of nonlinear approximation theory which 
identifies the exact conditions for a function 7 to be approximated at a certain rate by 



piecewise polynomial functions on adaptive one- dimensional partitions. We refer to 42 
for a detailed treatment and only state the result which is of interest to us. We say that a 
function 7 defined on a bounded interval / belongs to the approximation space A S (L P ) if 
and only if there exists a sequence (pn)n>o of functions where each p^ is piecewise affine 
on a partition of / by N intervals such that 

Wi-PnWlp < CN- S . 

For < s < 2, it is known that 7 G A S (L P ) provided that 7 G B S TT (I) with £ := - + s, 
where B!f. T (I) is the standard Besov space that roughly describes those functions having 
s derivatives in L T . In the case s = 2 and p — 1 which is of interest to us, we find r = | 
and therefore 7 should belong to the Besov space B\ Note that in our definition 

3'3 

of cartoon functions, we assume much more than B\ 1 smoothness on 7, and it is not 



clear to us if Theorem (4.3.2) can be derived under this minimal smoothness assumption. 



However it is striking to see that the quantity E 2 (f) that is revealed by Theorem (4.3.2) 
precisely measures the second derivative of the arc- length parametrization of T in the La 
norm, up to the multiplicative weight |[/]| 2 ^ 3 - Let us also mention that Besov spaces have 



been used in 43 in order to describe the smoothness of functions through the regularity 
of their level sets. Note that edges and level sets are two distinct concepts, which coincide 
in the case of piecewise constant cartoon functions. 

The quantity In] 1 / 3 is also encountered in mathematical image processing, for the 
design of simple smoothing semi-groups that respect affine invariance with respect to the 
image. Since these semi-groups should also have the property of contrast invariance, they 
can be defined through curve evolution operators acting on the level sets of the image. The 
simplest curve evolution operator that respects affine invariance is given by the equation 

dT , 1/3 



dt 



k\ ' n, 



where n is the outer normal, see e.g. |22|. Here the value 1/3 of the exponent plays a critical 
role. The fact that we also find it in E 2 (f) suggests that some affine invariance property 
also holds for this quantity as well as for ^(Z). We first notice that if / is a compactly 
supported C 2 function of two variables and T is a bijective affine transformation, then 
with / such that 

/ = />, 

we have the property 

d 2 f(z) = L T d 2 f(Tz)L, 
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where L is the linear part of T and L T its transpose, so that 

y/\det(d?f{z))\ = \detL\y/\ det(d 2 /(T*))|. 

By change of variable, we thus find that 

A p (f) = \detL\ 1 ' T - 1 A p {f) = \detL\^A p (f). (4.32) 

A similar invariance property can be derived on the interpolation error <7jv(/)p = ||/ — 
W N /IU p where T/v is a triangulation which is optimally adapted to / in the sense of mini- 
mizing the linear interpolation error in the LP norm among all triangulations of cardinality 
N. We indeed remark that an optimal triangulation for / is then given by applying T 
to all elements of Tn- For such a triangulation Tn '■= T(Tn), one has the commutation 
formula 

% / = (If, /) ° T > 
and therefore we obtain by a change of variable that 

Mi% = II/- If, fh P = I det^H/ - I Tn f\\ LP = I det L\^a N (f) p . (4.33) 

Let us finally show that if / is a cartoon function, then E 2 (f) satisfies a similar invariance 
property corresponding to p — 2, namely 

E 2 (f) = |det L\ l ' 2 E 2 (f). (4.34) 

Note that this cannot be derived by arguing that A 2 (f) satisfies this invariance property 
when / and / are smooth, since we lose the affine invariance property as we introduce the 
convolution by <ps '■ we do not have 

(/ o T) * (p s = (/ * <A5) o T. 

unless T is a rotation or a translation. 

Let Tj be one of the C 2 pieces of T and 7j : [0, Bj\ — > Q a regular parametrisation of 
Tj. The curvature of T on Tj at the point 7j(i) is therefore given by 

Since / = / o T, the discontinuity curves of / are the images of those of / by T : 

The curvature of Tj at the point T(^j(t)) is therefore given by 

«(T(7,(t)))- ||i7 , (()||3 -det(L) ||£t , (()||3 . 

This leads us to the relation : 

Idet^l^l^.^))!^ ||v.( t )|| = |«(T( 7 ,(0))r /3 ||^7'(0ll, (4-36) 



180 



Chapter 4. From finite element approximation to image models 



and therefore 







detLl 1 / 3 / |[/](7i(i))| 2/3 |«(7i(i))| 1/3 lbj(*)ll#, 
Jo 

detLl 1 / 3 [ |ffl| 2/3 k| 1/3 . 



r, 



Summing over all j = 1, • • • ,1 and elevating to the 3/2 power we obtain (4.34). 



4.5 Numerical tests 

We first validate our previous results by numerical tests applied to a simple cartoon 
image : the Logan-Shepp phantom. We use a 256 x 256 pixel version of this image, with a 
slight modification which is motivated further. This image is iteratively smoothed by the 
numerical scheme 



u n+l _ + U i+l,j + U i-l,j + V %j+l + U tj-l 

2 



This scheme is an explicit discretization of the heat equation. Formally, as n grows, u n is 
a discretization of 

1 _IMI 2 

u * Vx^r with <p s (z) := — e I 3 ", (4.38) 

where u stands for the continuous image. The determinant of the hessian is discretised by 
the following 9-points formula 

<j ■= K i+ i-2^ i + < i _J(< +lii -2< i + <_ 1)i ) 

n , n _ n _ n \2 (4.39) 

"i-lj'-l "i+lj'-l "i-lj+lJ v y 

16 



For each value of n, we then compute the £ T norm of the array (y^MljT) f° r r £ [§> 1]> 

which corresponds top G [1, oo] with i := 1 + -. This norm is thus a discretization of the 
quantity 



For each value of n we obtain a function r G [|, 1] — > D n {r) e M + . 

As n grows, three consecutive but potentially overlapping phases appear in the beha- 
viour of the functions D n , which are illustrated on Figure |4~2| 



1. For small n, the 9-points discretisation is not a good approximation of the determi- 
nant of the hessian due to the fact that the pixel discretization is too coarse compared 
to the smoothing width. During this phase, the functions D n decay rapidly for all 
values of r. 
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Figure 4.1 - The Logan-Shepp image (left), modified image (center), smoothed image 
(right). 
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Figure 4.2 - The curves D n (r) for n < 50 (left), 50 < n < 100 (center) and 100 < n < 
200 (right). 



2. For some range of n, the edges have been smoothed by the action of (4.37), but the 



parameter \yjn in (4.38) remains rather small. Our previous analysis applies and we 
observe that D n (2/3) is (approximately) constant while D n (r) increases for r < 2/3 
and decreases for r > 2/3. 

3. For large n, the details of the picture fade and begin to disappear. The picture begins 
to resemble a constant picture. Therefore the functions D n decay for all values of r, 
and eventually tend to 0. 

In our numerical experiments, we used the well known Shepp-Logan Phantom with 



a slight modification as shown on Figure |4.1| : we have removed the thin layer around 
the head, which represents the skull, because it disappears too quickly by the smoothing 
procedure and causes phases 1 and 3 to overlap, masking phase 2. For more complica- 
ted functions /, such as most photographic pictures, phases 1 and 3 also tend to overlap 
for similar reasons. Indeed, these pictures often have details at the pixel scale, including 
electronic noise due to the captor. Since these details disappear early phase 3 begins im- 
mediately, therefore phase 2 cannot be observed. 
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Our next discussion aims at comparing the quantities A 2 (f) with the total variation 
TV(/) on numerical images. In particular, we want to compare how these quantities 
measure the geometric complexity of images. Here, we consider non-discretized images 

z = (x,y)e [0,l] 2 ^/(z), 

and we compare the numerical behaviour of A 2 (f) and TV(/) for some relevant cases. 
Let us recall that for any g G C 2 (1R + ), if / is the radial function 

f(z)=9(\z\), 

one has 

det (d 2 f(z)) = ±-g>(\ Z \)g»(\z\). 

As a result we find that the two functionals TV and A 2 behave similar on the oscillatory 
images fw(z) := cos(a;|,z|) illustrated on Figure [43 (left), as oj — > +oo : 

TV(/ w )~u; and A 2 (f u ) ~ oj 3/2 . (4.40) 

We next consider the cartoon images defined by g u (z) = , which have the form of a 



circular staircase with approximately oj steps of height 1/oj, as illustrated on Figure 4.3 



(center). In this case, we may give a meaning to A 2 (g w ) based on (4.29), which since g u 
is piecewise constant gives 



A 2 {g^ = C Jjg^\4., 



where V are the circular curves of discontinuities. From this we find that as oj — > +oo, 

TV(g u )~l and A 2 (g u ) ^ V^- (4-41) 

This shows that in contrast to TV, the functional A 2 penalizes images which have the 
appearance of a staircase. 

Finally we consider the cartoons functions h u (z) = Xs^ where the set S w is defined by 

f 7T 1 

S w := <z = (rcos8,r sin 9) ; < 9 < — , and < r < 1 H — cos{u9) 

2 oj 



see Figure 4.3 (right). These functions have therefore only one step but its geometry 
becomes more and more oscillatory as u grows. More precisely its length remains finite, 
but its curvature behaves like oj. In turn, we obtain that as oj — > +oo, 



TV(h u )~l and A 2 (K)~^j. (4.42) 

This shows that in contrast to TV, the functional A 2 penalizes the fact that the curve of 
discontinuity dS^ strongly oscillates. 

These examples suggest that the functional A 2 gives a more relevant account of the 
complexity of cartoon images than the total variation TV. In particular staircasing effects 
and oscillatory curves of discontinuity are penalized. 



4.6. Applications to image restoration 
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4.6 Applications to image restoration 

The previous discussion suggests that we could use A 2 as an alternative to TV as a 
prior in image restoration. In particular, we may try to replace \q\bv by A 2 (g) in (4.2), 
and therefore consider the minimization problem 

Tom{A 2 {g) ; \\Tg - h\\ L 2 < e}, 

9 

or its formulation using a Lagrange multiplier 

min\\Tg-h\\l 2 +tA 2 (g) 2 /\ (4.43) 

9 



Our first observation is that, in contrast to (4.2), these problems are non-convex. 

Moreover, it is easily seen that even in the very simple case of image denoising corres- 
ponding to T = Id, the above problems are ill-posed, in the following sense. 

Proposition 4.6.1. For any f G L 2 ([0, 1] 2 ) ; there exists a sequence (f n ) n >o of C°° func- 
tions such that 

Wf-fnWv^O and A 2 {f n ) ->0. 

Consequently the infimums of A 2 (g) over those g G C 00 such that \\g — h\\ L 2 < e and of 
\\g~h\\ 2 L 2 + tA 2 (g) 2 ^ 3 over all g G C°° are both equal to 0, and are not attained in general. 

Proof: If T is a triangulation of the domain [0, l] 2 , and if g is piecewise affine on T, 
we may consider its regularized version gs '■= g * <fis- From Theorem 4.3.2, we find that 
A 2 (gs) —7- as 5 — >• 0. The proof follows by observing that piecewise affine functions on 
triangulations are dense in L 2 . o 



Note that the above result is formulated in the setting of non-discretized images, and 
does not exactly hold for numerical images if we define A 2 using the 9-points formula 
(4.39) for the discretization of the determinant of the hessian. However, we expect that 
the ill-posedness in the continuous setting is reflected by a bad behaviour of the solution 
of the discrete optimization problem. 



184 



Chapter 4. From finite element approximation to image models 



In summary, a straightforward generalization of the minimization approach (4.2) with 
A 2 in place of TV is doomed to fail. In order to circumvent this difficulty, we propose 
a different strategy based on a restoration algorithm introduced in 
algorithm is based on a bayesian framework that we briefely recall. 



70 and 71 . This 



4.6.1 A bayesian approach to image denoising 

For simplicity, we focus from this point on the discrete setting, in which images are 
discretized on a N x N grid where N > 1. We denote by 



!>NxN 



the collection of all discrete images. Bayesian restauration algorithms are based on a prior 
probability distribution on J 7 describing how certain images are more "plausible" than 
others. One way to build such priors is through a functional 

J : 7 -> R + 

which reflects the plausibility of an image : typically the value of J is large for complex 
images and zero only for few very simple images. The functional J is typically obtained 
as the discretization of a measure of smoothness, such as the total variation TV or t he 
quantity A^ 3 (discretized with the nine points formula (4.39)) which is shown in § |4.5| to 
give a good account of the complexity of cartoon images. 
Assuming that the quantity 



j j 



exp(-J(F))dF 



is finite, our bayesian prior for plausible images is the probability distribution 

e -J(F) 



-dF (4.44) 

where dF stands for the Lebesgue measure on T . 

We use the simplistic, although popular, model of additive gaussian noise in which 
random drafts from J 7 with respect to the probability 



g— a||n|| 



-dfl, (4.45) 



where a > is a parameter and Z a is a normalizing factor, represent typical noise. 
We regard a corrupted image G as a random variable defined as the sum 

G = F + tt 



of a random variable F of distribution (4.44), the original image, and a random variable 
Q of distribution (4.45), the corruption by additive gaussian noise. 
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By construction the conditional probability density p(G\F) of the corrupted image G 
with respect to the original image F is 

-a||n|| 2 -a\\G-F\\ 2 

P(G\F) = e -— = — . 

Zj a Zj a 

The conditional probability density p(F\G) of the original image F with respect to the 
corrupted one G is obtained by the bayesian rule of conditional probabilities 

p(F\G)p(G)=p(G\F)p(F). 

Replacing p(G\F) and p(F) with their explicit expressions we obtain 

P(F\G) = -L-p(G\F)p(F) 

1 17-11 2 \ / 1 



p(G) \Z, 



_ e -a\\G-FU — e ~J{F) 



1 -o.\\G-F\\l-J{F) 



p(G)Z a Zj 

We define 

a(F, G) = exp(-a\\G - F\\ 2 2 - J(F)). 

For any fixed G, the explicit function F G F <r(F, G) G is therefore proportional 
to the conditional probability density F i— > p(F\G). 

In order to recover the original image F from the corrupted one G a first approach, cal- 
led the maximum a posteriori (MAP), consists in maximizing the conditional probability 
density 

Fj := argmax p(F\G) 

= argmax a(F,G) 
fgf 

= argmin a\\F - G\\ 2 + J(F) 

F&F 



We thus recover the optimization procedure (4.43) in the c ase T = Id of image denoising, 
and of the functionnal J = A^ 3 . As observed in Proposition 4.6.1 this approach is doomed 



9 / Q 

to fail for the functional A 2 . For the total variation functional J = TV, the MAP 
approach gives good results, but is also known to produce visual artifacts : the restored 
image is exactly constant on large regions, delimited by sharp discontinuities which do 
not correspond to a feature of the original image. The heuristical reason of this problem 
is that the maximum of a probability density is generally not a good representative of a 



random draft for this probability, as discussed in 70 . 

A second approach, called the minimum mean square error (MMSE), consists in finding 
F which minimizes the empirical quadratic risk \\F — F'\\ 2 with respect to a random image 
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F' distributed according to the conditional probability density p(F'\G) : 

Fj := argmin / \\F - F'\\ 2 p(F'\G) dF' , 

= argmin I \\F - F'f o(F' ,G) dF' , 
Far Jt 

= Fv(F,G)dF. 



where Z — Jj. cr(F, G) dF is a normalizing factor. This method is studied in depth in 70 
in the case J = TV where it is shown that it has the advantage of suppressing the visua 
artifacts observed in the MAP approach, and we describe in the next section its adaptation 
to the case J = A 2 / 3 . 



4.6.2 The restauration algorithm 

The computation of the estimator Fj is a numerical challenge, since it involves an 
integration on the space F = M> NxN which has dimension at least 512 x 512 ~ 250000 
in realistic cases. We use an algorithm which was first proposed in the thesis 



70 , and 



which is a variation on the Metropolis Hastings algorithm. We shall only sketch its main 
principles and we refer to the original thesis for more details. 

The central point is to define a Markov chain (Fk)k>o of images in F which is recurrent 
with respect to the probability measure <r(F, G)dF/Z, where G is the denoised image and 
Z is a normalizing factor. Then, almost surely, 



lim , , 

K^-oo K 



0<fc<_ftT-l 



1 -^Fa(F,G)dF. 



(4.46) 



Two successive images Fk and Fk+i generated by the algorithm proposed in 70 only differ 



by the value of a single pixel. The cost of a step of the algorithm, generating a new image 
Fk, comes mainly from the computation of the ratio 



a(U, G) 



exp(|| V - Gf - \\U - G\\ 2 + J{V) - J{U)\, 



(4.47) 



where U and V are two elements of J 7 which differ at a single position 

This algorithm applies to any continuous functional J G C ,0 (J r , M + ), hence in particular 
to non- convex junctionals such as J = A^ 3 . For numerical applications, in order to have 



a good speed of convergence, the ratio (4.47) needs to be extremely cheap to compute in 



terms of computer time. The discretisation of the total variation TV or of the functional 
A^ 3 , are given for an image F by respectively 



N ^ 



+ 



and N' 



-2/3 



/^\di,j\ 



'■j 



h3 



where dj j is given by the formula (4.39). The ratio (4.47) is cheap to compute numerically, 



as required by the algorithm, since it has an explicit algebraic expression which only 
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involves the the values of the images U,V G J 7 at the positions (i + k,j + I) where 
max{|&|, |/|} < 2, where denotes the single position at which these images differ. 

From a theoretical point of view, the averaging procedure (4.46) converges at the 
speed 0{K~v). Unfortunately the constant in front of the convergence rate plays a key 
role in practice, and we did not manage to "reach the convergence" for realistic 512 x 512 
images with the functional J = A^ 3 . This issue might be solved by a parallelization of 
the algorithm, or a modification of the algorithm in which a group of neighboring pixels 
is modified at each step instead of a single pixel. For the time being, we thus only present 
one-dimensional results. 



4.6.3 Numerical illustration in the ID case 

We present numerical results in one dimension only, using some counterparts of the 
va 

/ G C 2 ([0,1]) the quantity 



2/3 

total variation TV and of the functional A 2 in that context. We define for each function 



D(f) := / VWl- (4-48) 
J[o,i] 

The next proposition shows that D(f) can be given a meaning when / has localized 

2 /3 

discontinuites, similarly to the functional A 2 for cartoon images. 

Proposition 4.6.2. Let f : [0, 1] — > R be piecewise C 2 , with a finite set E of discontinuity 
points in ]0,1[. Let (f be a C 2 mollifier supported in [—1,1], satisfying J R ip = 1 and 
<p(x) = ip(—x) for all x G R. For all 5 > let <p$ '■= fv 9 ^) an d ^ fs '■= f * fs- Then 
fs G C 2 ([5, 1 — 5}) and satisfies 

Jim / J\f[\ = [ + Citp) V\UM (4-49) 

S ^°J[S,l-8] V J]0,1[\E ~ 

where C{ip) := / R 

Proof: We denote by < c\ < ci < ■ ■ ■ < e n _i < 1 the points of E and we define eo = 
and e n = 1. The function / can written as the sum 

f = J + A + S, 

where J (the Jump part) is piecewise constant, and A is continuous and piecewise Affine, 
with respect to the partition ( ]e«, ej + i[ ) < i<n _ 1 of the interval [0, 1]. The function S is C 1 
on ]0, 1[, and S" is uniformly bounded. Then, on the interval [S, 1 — 5], 

fs = f * <pe = {J + A + S) * ips = Js + A s + S s . 
We now assume that the parameter 5 satisfies 



< 5 < min 



e i+l ~ e i 



0<i<n-l 2 

For any < i < n — 1 one has on the interval [e, + 5, Ci + \ — 5] 

Js = J, As = A and Ss = fs — A — J. 
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TV«D 



TV*D ^~ 
Figure 4.4 - Behavior of the functionals TV and D on different types of functions. 

Hence J'J = Ag = 0, and S' s ' = f'J converges uniformly to /" as 5 — > 0, on this interval. 
Therefore 

i im n E P +1 = [ vln (4.50) 

Furthermore for any 1 < % < n — 1 and any \x\ < 1 one easily checks that 

J' s '( ei + Sx) = ^[Mei) <p\x) and A'fa + 5x) = ^[f}(e,) <p(x) 

where 



[f]( ei ) := lim f( ei + e) - /(e, - e) and [/' ](e,) := lim /'(e, + e) - /'(e, - e). 

On the other hand Sg (e^ + <5x) is uniformly bounded independently of x and 5. Hence the 
contribution of J'J is dominant on the intervals [e$ — 5, + 5], and we obtain 



KKn-r e » _l! KKn-1 



(4.51) 



Combining (4.50) and (4.51) we conclude the proof of this proposition. 



For the simple oscillating function f u (x) := cos(a;x), illustrated on Figure 4.4 (left), 
the two functionals TV and D behave similarly as u> — » oo : 



TV(/ w )~w and D{f u 



ijj. 



(4.52) 



On the contrary for the function g w (x) := illustrated on Figure 4.4 (right) we obtain 

as cu — > oo 

TV(^)~1 and D{g u )~Vti, (4.53) 



where D(g UJ ) is understood in the sense of ( 4.49[ ). Hence the functionals TV and D treat 
discontinuities very differently, and the latter penalizes functions which have the appea- 
rance of a staircase. 
We denote by 

7 :-- 
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the collection of discretized one-dimensional functions. For any F G F, we denote by 
TV(F) and D(F) the discretization of the functionals TV and D using finite differences, 
as follows 

TV(F): = Yl l^-^+il- (4-54) 

l<i<N-l 

and 

D(F):= Yl V\F i+ i-2F i + F i _ 1 \, (4.55) 

2<i<N-l 

We now turn to numerical results. Given a one dimensional discretized function F G 
F := M. N we compute using the algorithm described in £ 4.6.2 three different regularizations 
ofF. 

F TV := argmin ai||F-F / || 2 + TV(F') 

F'm N 

F TV == 4- / exp(-Q! 2 ||F - F'f - f3 2 TV{F'))dF' 

^2 Jrn 

F D := ±- [ F'ex P (-a 3 \\F-F'\\ 2 -{3 3 D(F'))dF', 



where Z 2 and Z 3 are normalizing coefficients. The parameters otj, a2, «3 and /32, ^3 can be 
freely chosen. 

We have not attempted to denoise some data and to compare the PSNR of the res- 
toration by the different methods. Indeed such a comparison should, in order to be fair, 
involve an in depth analysis of the role of the parameters ai, 02,0(3 and /?2,/^3, that we 
did not have the time to do. Note also that F TV involves a single parameter, while two 
parameters are needed for Ftv and Fjj. We refer to the thesis |70] for the comparison of 
FJv and Fpv from the point of view of the PSNR. 

Instead we regard the three procedures as regularization methods, that we apply to 
different test functions. We focus on the qualitative differences of the regularizations F^ v , 
F T v an d F D of different functions F, and we discuss on how these differences traduce the 
different bayesian priors implicit in the three methods. 



Our first experiment, see Figure 4J), is the regularization of a random walk. As remar 
ked in 



70 



FJv i s constant over large intervals and also has several sharp discontinuities. 
These two types of features were not present in the original function F and are undesi- 
rable. They are avoided in the two other regularizations Fpv and Fd- 



Our second experiment, see Figure 4.6 is the regularization of a Heaviside function. 
Up to numerical artifacts, the functions F and F^ v are identical, which is precisely what 
is wanted in this situation. The discontinuity disappears in Ftv an d Fd, and is replaced 
with a smooth but sharp and well localized transition. Numerical experiments on real 
(bidimensional) images in 70 show that generally, and with appropriate parameters, the 
regularization Ftv of an image F does not cause a perceptible blurring of the edges which 
appear in the original image. 
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Figure 4.6 - Heaviside function. (Top, Left) Original F, (Top, Right) F^ y , (Bottom, Left) 
F TV , (Bottom, Right) F D . 
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This example puts in light a strong flaw of the regularization method Fp : it does not 
obey the maximum principle. Indeed, it is clear on Figure 4.6 that and -Ftv take their 
values in the interval [min F, max F] , but Fp does not. 



Our third and last experiment, see Figure 4.7 is the regularization of a "Churp" : an 
oscillating function of increasing frequency, here sin(x 3 ) where x G [0,4]. Qualitatively, 
most regularization algorithms regard slow oscillations as valuable data which should be 
preserved, and fast oscillations as noise which should be eliminated. The regularization of 
a churp allows to analyse the behavior of a regularization algorithm in the intermediate 
regimes. 



From a qualitative point of view, the MAP regularization F£ v again introduces un- 
desirable visual artefacts : F^ v is exactly constant on some regions, and VF^ V has sharp 
discontinuities which do not correspond to a feature of the original function. In contrast 
the MMSE regularizations F^y and Fp do not exhibit these artefacts. As anticipated, the 
slow oscillations of the original function F are preserved, while the fast oscillations are 
attenuated. It seems qualitatively that fast oscillations are more strongly attenuated in 
Fd than in -Fry- The author could not prove this last property, but a heuristical analysis 
suggests that sinuosidal oscillations of large frequency uj ^> 1 are attenuated by a factor 
ui~ 2 in Ftv and u~ A in F^. 
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4.7 Extension to higher dimensions and higher order 
elements 

The results on approximation by anisotropic bidimensional piecewise linear finite ele- 
ments that we have exposed in §4.2 have been generalized in Chapter [2] to the case of 



elements of arbitrary order m—1 defined on partitions of f2 C H d by simplices. Here, the 
local error is defined as 

e m ,r(/) P := 11/ - I™ 1 /IU P (T), 

where I™ -1 denotes the local interpolation operator on P m _i for a <i-dimensional simplex 
T. This operator is defined by the condition 

for all points 7 G T with barycentric coordinates in the set {0, — r, — ^r, • • • ,1}. We 
denote by H m C P m the collection of homogeneous polynomials of degree m, and we 
define for any q G H m the quantity 

^m,p(q) := M e miT (q) p . 
We refer to K m>p as the shape function. For piecewise linear elements in dimension two, 



i.e. m = d = 2, we have observed that K p = i^ 2 , P has the special form given by (4.10) 
which justifies the introduction of the quantity A p (f). In a similar way, it can easily be 
proved that for piecewise linear elements in higher dimension, i.e. m = 2 and d > 2, one 
has 

ci|det(q)| 1 / d < K 2 , p (q) < c 2 |det(q)| 1 / d . 

For piecewise quadratic elements in dimension two, i.e. m = 3 and d — 2, it is proved in 
Chapter [2] that 

Ci|disc(q)| 1 / 4 <K 3 , P (q)<c 2 |disc(q)| 1 / 4 . 
for any homogeneous polynomial q G H 3 , where 

disc(ax 3 + bx 2 y + cxy 2 + dy 3 ) := b 2 c 2 - 4ac 3 - 4b 3 d + 18abcd - 27a 2 d 2 . 

For other values of m and d, equivalent expressions of K m ^ p (q) in terms of polynomials in 
the coefficients of q are available but of less simple form, see Chapter 2. 

Defining the finite element interpolation error by an optimally adapted partition 



a N (f)p--= mf 11/ -It /II 



#{T)<N 



LP, 



where CP is the global interpolation operator for the simplicial partition (possibly non 



conforming) T, the following generalization of (4.13) is proved in Chapter^: 



\imsup N d a N (f) p < C d 



K < dmf 



, ^ = - + 5. (4.56) 

L T (Q) t p a 
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The constant Cd is equal to 1 when d = 2 but larger than 1 when d > 2 due to the 
impossibility of exactly tiling the space with locally optimized simplices. If / is a C m 
function of d variables, it is therefore natural to consider the quantity 

A m>p (f):=\\K m , p (d m f)\\ L r {Q) , - = - + 7> ( 4 - 57 ) 

as a possible way to measuring anisotropic smoothness. For d = 2 and piecewise linear 



elements, we have seen in £4.2 that A 2}P (f) is equivalent to the quantity A p (f). 

Similarly to A p we are interested in the possible extension of A m)P to cartoon functions. 
We first introduce a generalisation of the notion of cartoon functions to higher piecewise 
smoothness m and dimension d. 

Definition 4.7.1. Let m > 2 and d > 2 be two integers. Let Q C M d be an open set. We 
say that a function f defined on Q is a C m cartoon function if it is almost everywhere of 
the form 



f= 2^ m 



l<i<k 

where the are disjoint open sets with piecewise C 2 boundary, no cusps (i.e. satisfying 
an interior and exterior cone condition) , and such that Q = U* =1 f)j. Additionally, for each 
1 < i < k, the function fi is assumed to be C m on Qi. 

Let us consider a fixed cartoon function / on a polyhedral domain Q C M d (i.e. Q is 
such that Q is a closed polyhedron), and a decomposition (f2j)i<i<fc of Q as in definition 
4.7.1 As before we define T := Ui <i<k dQi, the union of the boundaries of the fij. Our 
assumptions on the sets (fij)i<i<fc imply that T is the union of a finite number of open 
hypersurfaces (Xj)i<j<i, and of a set V of dimension d — 2. 

As in §4.3[ we now consider a sequence f^ of piecewise linear approximations of / on 
simplicial partitions Tn of cardinality N. We distinguish two types of elements of 7~n- A 
simplex T e Tn is called "regular" if T D T = 0, and we denote the set of these simplices 
by Tn- Other simplices are called "edgy" and their set is denoted by T§. We can again 
split Q according to 

n ■.= (u TeT rT) u (u Ter ^T) = n r N u n%. 

Heuristically, if the partitions Tn are built with approximation error minimisation in mind, 
the number of elements should be balanced between T N and T N . The partition T^ tends to 
cover most of the surface of Q, with simplices of diameter < CN~d, and L°° approximation 
error |/ — /jy| < CN~^ (since we use P m _i elements). On the other hand, since / has 
discontinuities along T, the L°° approximation error on T N does not tend to zero, and 
T N should thus be chosen so as to produce a thin layer around T. Let h be the typical 
diameter of an element of T N . Since the Tj has bounded curvature, this layer can be made 
of width 0(h 2 ) and therefore the layer around T has volume bounded by h 2, Hd-i(X) U P to 
a fixed multiplicative constant, where "Hd_i(r) is the d— \ dimensional Hausdorff measure 
of T. On the other hand the minimal number of such elements of diameters h needed to 
cover T is bounded by h^Ud^iT) up to a fixed multiplicative constant. Eventually, we 

_ 2 

find that the layer around Y has volume bounded by CN d - J . 
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Hence we have the following heuristic error estimate, for a well designed anisotropic 
partition : 

11/ — /iv||i4>(n) < 11/ — fN\\LP(n r N ) + 11/ - /ivlUp(n^) 

< 11/ - || I I* + 11/ - /jv|U-(^)l^l^ 

This leads us to define a critical exponent 

Pc = Pc{m,d) :-- 



m(d — 1) 



If one measures the error in L p norm with p > p c (m,d), then the contribution of the 
edge neighbourhood fl e N dominates, while if p < p c (m,d) it is negligible compared to 
the contribution of the smooth region Q r N . For the critical exponent p = p c (m, d) the two 
terms have the same order, which makes the situation more interesting. Note in particular 
that p c (2, 2) = 2, which is consistent with our previous analysis. 

For p < p c (m,d), we obtain the approximation rate N~ m ^ d which suggests that ap- 



proximation results such as (4.56) should also apply to cartoon functions and that the 



quantity A mjP (f) should be finite for such functions. We again need to use a regularization 



approach, for the same reasons as in §4.3| For a given dimension d, we consider a radial 
nonnegative function <p of unit integral and supported in the unit ball of lR d , and we define 
for 5 > 

¥s( z ) : = m^It) and fs = f*¥s- (4.58) 



5, 

In order to define the quantities of involved in our conjecture, we need to introduce 
the second fundamental form of an hypersurface. At any point x G T \ V we denote by 
n(x) the unit normal to V. Note that since V is piecewise C 2 , the map x 1— > n(x) is C 1 
on r \ V. We define T X T := n(a;)" L , the tangent space to T at x. In a neighbourhood of 
x G T \ V, the hypersurface T admits a parametrization of the form 

u G T X T ^ x + u + X(u)n(x) ET\V, 

where A is a scalar valued C 2 function. By definition, the second fundamental form of T at 
the point x is the quadratic form ll x associated to d 2 X(0) which is defined on T X T x T X T. 
Alternatively, for all u,v G T X T we have Jl x (u,v) := — (d u n, v). The Gauss curvature n(x) 
is the determinant of H^, in any orthonormal basis of T X T, 

k(x) := det H^.. 

For example, in two space dimensions the tangent space T X T is one dimensional, and we 
simply have H x (u, v) = k(x)(u, v). We also denote by cr(x) G {0, • • • , d— 1} the signature 
of the quadratic form II X , which is defined as the number of its positive eigenvalues. 
With t such that - := >n } + -, we define 

t d p' 

S p (f) := \\K(d m f)\\ L . (n \ r) = A p (f ln \ r ). 
We conjecture the following generalization to Theorem |4.3.2| 
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Conjecture 4.7.2. There exists d positive constants C{k), k G {0, • • • , d— 1} ; that depend 
on ip,p,m,d, such that, with 



E p (f):= ||CV)M%]lk(r) 



\C{a(x))\K(x)\%[f{x)}\ T dxY 



we have 

- If p < p c then 



\imA m:P (f s ) = S p (f). 

o— >0 



- If p = p c then 

- If p > p c then 

In the remainder of this section, we give some arguments that justify this conjecture. 



hm(A m ^f s )) = (S p (fy + E p (fY) 1/T 

0— ¥i) 



lim S pc vA m:P (f 5 ) = E p (f). 

5— >0 



Given a cartoon function /, we define the sets Q$, T®, T$ and Vs similarly to £4.3 We 
need to perform an asymptotic analysis of the integral 



/ K(d m f 5 ) T = [ K(d m fs) T + [ K(d m f 5 ) T + [ K(d m f s ) T . 
Jn Jq s JVs Jrs 



(4.59) 



As in the proof of Theorem 4.3.2 the contribution of Vs can be proved to be negligible 
compared with those of fl$ and as 5 — > 0. The contribution of Q$ satisfies 



lim / K(d m f 5 y = [ K(d m fY. 
In* Jn\r 



'n s Jn\i 
The main difficulty lies again in the contribution of IV Let us define r c by 

1 

t c 

The contribution of T$ can be computed if one can establish an estimate generalizing 



m 1 

d p c 



(|4.25|) according to 

5^K miP (d m f s (z)) - \[f}(x)\\ K (x)\^ m>d>a{x) (u) < u(5) 



(4.60) 



where u(5) — > as 5 — > 0, x G u G [—1, 1], z — x + 5un(x), and where the function 
& m ,d,k '■ [—1, 1] — > K only depends on m, d, k and <p. If (4.60) holds, we then easily derive 
that 

hrn^- 1 f K(d m fsY= [ C(a)\K\^\[f]r, 

with C(k) := J\ \<& mjC i,k(u)\ T du, which leads to the proof of the conjecture. 

We do not have a general proof of (4.60 ) for any m, p and d. In the following, we justify 



its validity in two particular cases for which the explicit expression of K miP is known to 
us : piecewise quadratic in two space dimensions (d = 2 and m = 3) and piecewise linear 
in any dimension (m = 2). 
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Piecewise quadratic elements in two dimensions. For all 5 > 0, x G T° and 

u G [—1,1], let ir Xt s >u G H 3 be the homogeneous cubic polynomial on IR 2 corresponding to 
d 3 f$(x + Sun(x)). Let also 7r XjU G H 3 be the homogeneous cubic polynomial on IR 2 defined 
by 

ir X}U (Xn(x) + fit(x)) = -\{<5>"{u)\ 2 - 3$' \u) k(x) u 2 ) (4.61) 



for all (A,/i) G IR 2 , where $ is defined by (4.17). For all x G T, we denote by M Xt s the 
(symmetric) linear map defined by 

M Xt sXi{x) = 5n(x) and M X) st(x) = V / 5t(x). 

Then, using a reasoning similar to the one used in the appendix of this chapter, it can be 
proved that 

\\ir x ,s,u ° M XjS - [f](x)n XjU \\ < u(6). (4.62) 

where \ims^ u(5) = and the function uj depends only on /. Furthermore, it is proved 
in Chapter [2] that for all q G H3 

K 3 , p (q) = C^|discq| 

where the positive constant C depends on p and the sign of discq. Combining this ex- 



pression with (4.62) proves (4.60) and thus the conjecture in the case m = 3 and d = 2. 



Piecewise linear elements in any dimension. We use the second fundamental form 
of the discontinuity set T in order to evaluate d m fs on F$. Characteristic functions are one 
of the simplest types of cartoon functions. In that case, it is possible to establish a simple 
relation between the second fundamental form of the edge set and the second derivatives 
of / in a distributional sense : if Q C IR d is a bounded domain with smooth boundary T 
and inward normal n, we then have for all C 2 test function ip and u, v G IR d 



= y r (w,n)^,n)(^-Tr(II)^) + ir( M ,^, (4.63) 

where H' x (u,v) is the second fundamental form ELj. applied to the orthogonal projection 
of u and v on T X T. The proof of this formula (that generalizes the simpler bidimensional 



case (4.70) which is proved in the appendix) is given further below. For all x G T, we 



denote by M Xj s the (symmetric) linear map defined by 

M x ^n(x) = 5n(x) and M x ^t = V~5t 

for all t G T X T. For all 5 > 0, x G T° and u G [—1, 1], let tx x> s, u G H 2 be the homogeneous 
quadratic polynomial on W 1 corresponding to d 2 fs(x + 8un(x)). Let also ir XiU G H 2 be the 
homogeneous quadratic polynomial on M. d defined by 

7r XtU (Xn(x)+t) = $'(m)A 2 - <S>(u)Tl x (t,t) 



for all A G IR and t G T X T, where $(x) := (p(x,y)dy. Then, using (4.63) and a 



reasoning analogous to the one presented in the appendix, it can be proved that 

hx,s,u M x,s ~ [f}(x)n X:U \\ < u(5). (4.64) 
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where lim^o ^(^) — and the function u depends only on /. Furthermore, it is proved 
in Chapter [2] that 

tf 2iP (q) = C^|detq| 

where the positive constant C depends on d,p and the signature of q G H 2 . Combining 



this expression with (4.64) proves the estimate (4.60) and thus the conjecture in the case 



m = 2 in any dimension d > 1. 



Proof of (4.63) : Let proj r be the orthogonal projection onto T, and for all x G Y 

let proj^. be the orthogonal projection onto T X Y. We consider a vector u G M. d and we 
define U : Y — > Y by U(x) := proj r (x + u). If \\u\\ ||II||z,° c (r) < 1, then U is smooth and its 
differential d x U : T X Y — > T X >Y, where 

x = XJ(x), 

is given by the following formula 

d x U = (Id-(«,n(a; / ))II^) _1 proj a: ' 

The determinant of d x ll (more precisely the determinant of the matrix of d x ll in direct 
orthogonal bases of T X Y and T X >Y) is 



det(4U) = det(Id-(M,n(x , ))II x .O _1 (n(x),n(x')) 



(u, n(a/)) Tr(II x /) + ||tt||a;i(M, x) 



where u\(u, x) tends uniformly to as u — > 0. Furthermore, it is easy to show that 



(x + u) — if)(x') — (u, n{x))d n ^)ip{x')\ < C\ 



u 



and ||n(x') —n(x) — H x /(j)Toj x ,(u))\\ < \\u\\uj2(u), where C and u 2 are independent of x G Y 
and u>2{u) — > as u — > (the quadratic form H x (r, s) := — (<9 r n(x), s) for all r, s G T X Y is 
identified here to the differential of n). Combining these results, we obtain 



ip(x + u)(n(x),v)dx 



ij){x + u)(n(x), v) det(d x > XJ U ) l dx' 
{n(x'),v)ip(x')dx' 



+ j (n(x'),v)(n(x'),u)(d nix/) ip - Tr(Ii;)^(a; / )) + (v, II , a ,(proj x ,(M)))^(x , )rfa; / 
+ \\u\\u 3 (u). 



where us(u) — > as u — > 0. We conclude the proof of (4.63) using the formula 



dl v ip = \imh 1 / (ip(x + hu) — tp{x))(n{x), v)dx. 



o 
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Remark 4.7.3. Similarly to the results presented in §4-4 < there is an affine invariance 
property associated to k :ifT is an affine transformation ofW 1 with linear part L, and if 
f = f o T, T = T(r) and k is the Gauss curvature ofT, then one has for any s > 0, 



(detL)a+r / |C((t)[/]| s |k|3+t 



\C(Z)[f~]\ s \K\ 



It follows from this observation that when p = p c , the contribution of the edges is affine 
invariant in the sense that 

E Pc (f) = (detL)wE Pc (f). 

Since one also has A mp (f) = (det L)^ 1 A mp (f) this comforts the conjecture. Let us men- 
tion that the quantity \K\ d + 1 has been used in J81 in order to define surface smoothing 
operators that are invariant under affine change of coordinates. 



4.8 Conclusion 

In this chapter we have investigated the quantity A p (f) which governs the rate of ap- 
proximation by anisotropic Pi finite elements as a way to describe anisotropic smoothness 
of functions. This quantity is not a semi-norm due to the presence of the non-linear quan- 
tity det(d 2 f) and cannot be defined in a straightforward manner for general distributions. 
We nevertheless have shown that this quantity can be defined for cartoon images with 
geometrically smooth edges when p < 2. A theoretical issue remains to give a satisfactory 
meaning to the full class of functions for which this quantity is finite. 

From a more applied perspective, it could be interesting to investigate the role of A p (f) 
in problems where anisotropic features naturally arise : 

1. Approximation of PDE's : in the case of one dimensional hyperbolic conservation 



laws, it was proved in 44 that despite the appearance of discontinuities the solu- 
tion has high order smoothness in Besov spaces that govern the rate of adaptive 
approximation by piecewise polynomials. A natural question is to ask wether simi- 
lar results hold in higher dimension, which corresponds to understanding if A p (f) 
remains bounded despite the appearance of shocks. 



2. Image processing : as illustrated in £4.5, the quantity A p (f) can easily be discretized 



and defined for pixelized images. It is therefore tempting to use A 2 (f) i n a similar 



way as the total variation in (4.2), by solving a problem of the form 



mm{A 2 (g) ; \\Tg - h\\ L , < e}, (4.65) 

gEBV 

with the objective of promoting images with piecewise smooth edges. The main 
difficulty is that A 2 is not a convex functional. One way to solve this difficulty could 



be to reformulate (4.65) in a Bayesian framework as the search of a maximum of an 
a-posteriori probability distribution (MAP) as an estimator of /. In this framework, 
we may instead search for a minimal mean-square error estimator (MMSE), and 
this search can be implemented by stochastic algorithms which do not require the 



convexity of A 2 , see 70 and £4.6 



4.9. Appendix : proof of the estimates {4.22^- (fOgD- f 4 . 24\) . 
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4.9 Appendix : proof of the estimates (4.22)-(4.23)- 



(4.24). 



It is known since the work of Whitney on extension theorems (see in particular |91| ) 
that for any open set U C M. d , and any g G C 2 (U) there exists g G C 2 (R d ) such that 
g\u = g. It follows that for each 1 < i < k, there exists /, G C 2 (IR 2 ), compactly supported, 
and such that fi\n. = /j. 

Let Tj be one of the pieces of T, between the domains flk and fli, and let s = fk and 
t — fi — ft- Although the domains Qk and Qi are only piecewise smooth, there exists an 
open set Q' with C 2 boundary such that for 5 > small enough 

f = sXw+t on |J (T^s + Bs), 

0<S<S 

where 13$ is the ball of radius 5 centered at 0. Note that Tj C V :— dQ' and that s = [f] 
on Tj. In the following, the variables x, z are always subject to the restriction 

x G r° 5 and z = U$(x, u) = x + 6un(x) where < 5 < 5 and \u\ < 1, (4.66) 

note that z G Tjj and ||x — z\\ < 5. We therefore have 



fs(z) 



s(x)(ps(z — x)dx + t$(z), 



where tg :—t * (pg. The second derivatives of tg are uniformly bounded, and are therefore 



negligible in regard of all three estimates (4.22), (4.23) and (4.24), indeed 



\\{d 2 t) * <Ps\\l°° < IM 2 ^||l°° Hv'islU 1 = ll^ 2 £|U°° II^IIl 1 < °°- 



We now define the 2x2 symmetric matrices 
I(z,x) : 

so that 



(s(x) — s(x)) d 2 (fs(z — x) dx and J{z) := / d 2 (ps(z — x)dx 



d 2 f s (z) = d% + I(z, x) + [f\{x)J{z). (4.67) 

We already know that the contribution of d 2 tg is negligible. We now prove that the same 
holds for the contribution of I(z,x). Since <£g(z — x) is non-zero only if \\x — z\\ < 5 and 
therefore \\x — x\\ < 25, we can bound the norm of the matrix I(z,x) by 



\I{ z i x )\\ ^ 25||ds||£oo H^^^ll^i < 25||ds||ioo ||d 2 = C5 . 



(4.68) 



This proves that the contribution of I(z,x) is negligible for the two estimates (4.22) and 



(4.23). In order to prove that it is also negligible in the estimate (4.24), we need a finer 
analysis of t(x) T I(z, x)t(x). For this purpose we fix a unit vector u and the pair (x,z). 
We introduce 



A(x) := (s(x) - s(x)) d u ip s (z - x) + d u s(x) <p s {z - x), 
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so that by Leibniz rule 

(s(x) - s(x)) dl >u ip s (z -x) = dl u s{x) <p s {z - x) - d u A(x). 

Therefore 



u I(z,x)u 



(8% >u s(x) ip & {z -x)- d u A(x)) dx, 



d uu s(x) <ps(z — x)dx — I A(x)(n(x),u)dx. 

CI' ' JV 



The first integral clearly satisfies 

dl u s{x) <f S (z-x)dx 



< \\d 2 s\ 



and is therefore bounded independently of S. We estimate the second integral for the 
special case u = t(x), remarking that \(n(x),t(x))\ < C\8 on the domain of integration. 
Therefore 



A(x)(n(x),t(x))dx 



<Ca*|r'nfl(M)|||A|| LO 



where, slightly abusing notations, we denote by |r ; fl B(z,5)\ the length (1-dimensional 
Hausdorff measure) of the curve T'r\B(z, 5). Clearly A(x) = if — x\\ > 5. If \\z — x\\ < 5 
we have 

\A(x)\ < {\\x-z\\ + \\z - x\\)\\ds\\ L o°\\dip\\ L °o5~ 3 + 11^11^ I^H^r 2 < C 5~ 2 . (4.69) 
Since in addition \V fl B(z, 5)\ < C28, we finally find that 



A(x)(n(x),t(x))dx 



< CqC\C2- 



We have therefore proved that 



t(x) T I(z,x)t(x)\ < C, 



where the constant C is independent of 5, which shows that the contribution of I(z,x) is 



negligible in (4.24) 



We now analyze the contribution the quantity [f](x)J(z) in (4.67). For this purpose, we 
use an expression of the second derivative of the characteristic function Xw of a smooth set 
Q' in the distribution sense. We assume without loss of generality that V is parametrized 
in the trigonometric sense, and therefore that n is the inward normal to Q. For all test 
function ip, we have 



- / 



d u ip(v,n) = / (d D ip{u,n) + d t ip{u,t))(v,n) 



and, by integration by parts, 

d t ip(u,t)(v,n) 



ip({u, nn){v , n) — (u,t)(v, Kt)). 



4.9. Appendix : proof of the estimates (4.22)-(4.23)-(4.24). 
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(4.70) 



(4.71) 



Therefore, we have 

Jw Jv 
Applying this formula to ip(x) '■— (fs{z — x) we obtain 

= /(,, n ( i ))M i ) )(8#1 (,- i )- t (* i (,- £ ))fi 
n(x){u, t(x))(v, t{x))(fs{z — x)dx. 

Since Tj is C 2 , there exists a constant Co such that for all Xi,x% G Tj, we have 

| (t(a?i), n(a7 2 )> | < C \\x 1 - x 2 \\, 

and 

|1 - (n(xi),n(x 2 ))| = |1 - (t(z 1 ),t(ar 2 ))| < Colki - x 2 \\ 2 - 
We finally remark that \T'nB(z, 5)\ < CiS, and that < || || ^oo<5 — 2 and || c^nV 9 ^ II i 00 — 



Taking the vectors t(x) or n(x) as possible values of u and v in (4.71) and using the 



above remarks, we obtain the estimates 



n(x) 1 J(z)n(x) + / d n ips{z — x)dx < C5 1 , 
r' 

|t(x) T J(^)n(x)| < OS' 1 , 



t(x) T J(z)t(x) 



K(x)(fs(z — x)dx 



< C, 



(4.72) 
(4.73) 
(4.74) 



where the constant C depends only on /. In view of (4.67) we can immediately derive 



estimate (4.23) from (4.73). 



In order to derive the estimate (4.24) from (4.74), we first introduce the modulus of 
continuity uj of K on Tj, 

oj(S) := sup \k(xi) — k(x2)\- 

Xl,X2&Tj ; \\xi— !T2||<<5 

Therefore 



t(x) T J(z)t(x) + / n(x)ips(z — x)dx 



< 



t(x) T J(z)t(x) + k(x) / <ps(z — x)dx 



We now claim that 



+Cu(5)5- 1 . 
(ps(z — x)dx — 5 _1 $(n) 



<C, 



(4.75) 



holds with C independent of 5 which implies the validity of (4.24). In order to prove (4.75), 
we use a local parametrization of T' : let A : R — > R be such that for h small enough we 
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have, x + ht(x) + X(h)n(x) E T. Note that we have \X(h)\ < C h 2 and \X'(h)\ < C h for 
h small enough. Then for S small enough, 



ifs(z — x)dx — 8 



< 



tp s (ht(x) + (5u - A(/i))n(x)) v / l + X'{h) 2 dh 



(fs(ht(x) + 8un(x))dh 
< C5(||^|Uoo( v /l + (C 5) 2 - 1) + \\cbp s \\ L «,C 5 2 ) < C 
Finally, we can derive the estimate ( 4.22[ ) from (4.72) using the inequality 



J d n (ps{z - x)dx + 5 2 $'(m) 



< cr 1 



(4.76) 



which proof is very similar to the one of (4.75). 



Part III 

Mesh adaptation and riemannian 

metrics 
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Chapter 5 



Are riemannian metrics equivalent 
to simplicial meshes ? 
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5.1 Introduction 

Triangulations and meshes are finite objects of combinatorial nature : they can be 
described by a collection of vertices and of connections between these. This description 
well adapted to the demonstration of algebraic results, such as the Euler formula, or for 
computer processing. In contrast, many approaches towards anisotropic mesh generation 
are based on a continuous object, namely a riemannian metric z i-> H(z), in other words 
a continuous function H from the domain Q C IR d to the set S£ of symmetric positive 
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Figure 5.1 - Two triangles T and their associated ellipse St- 



definite matrices. Once this metric has been properly designed, it is the task of a mesh ge- 



neration algorithm such as 15,66,93,94 to generate a triangulation that agrees with this 



metric, in a sense specified below (5.2). The purpose of this chapter is to formulate pre- 



cise equivalence results between some classes of triangulations and of riemannian metrics. 
This equivalence translates some geometrical constraints satisfied by the triangulations 
into the form of regularity properties of the equivalent riemannian metrics. 

Our results are so far limited to meshes and metrics defined on the entire infinite 
domain IR d , and the dimension d > 2 is fixed throughout this chapter. The choice of the 
unbounded domain M d is guided by simplicity, since the curvature and the singularities of 
the boundary of a bounded domain induce additional difficulties from the point of view 
of computational mesh generation. Bounded domains will be the object of future work. 
We denote by T the collection of conforming simplicial meshes of lR d , and we introduce 



in £5.1.2 three embedded subsets of T of particular interest 



-LC 



C T a , c C T 



where C > 1 is a parameter which plays a minor role. The set T^c collects all isotro- 
pic meshes T which are heuristically defined as follows : the simplices T 6 T may have 
strongly varying volumes, but their aspect ratio is uniformly bounded. The largest set 
T Si c collects graded meshes, which only satisfy a condition of local consistency : all the 
geometrical features of the simplices T G T may vary strongly, volume, aspect ratio and 
orientation, but two neighboring simplices should not be excessively different. The in- 
termediate set T a ,c of quasi-acute meshes, is defined by a condition which involves the 
measure of sliverness S(T) of a simplex T introduced in Chapter [3] and also discussed in 
Chapter [6} The measure of sliverness plays an important role in the finite element ap- 
proximation of a function on a mesh T when the error is measured in the Sobolev W l,p 



norm. 



We associate to each simplex T a symmetric positive definite matrix Ht such that the 
ellipsoid 

S T := {z G R d ; {z- z T )U T {z - z T ) T < 1}, (5.1) 
which is centered at the barycenter z? of T, is the ellipsoid of minimal volume containing 



T. Two triangles T and their associated ellipses St are illustrated on Figure 5.1 We give 



in £5.1.1 the explicit expression of the matrix Ht in terms of the coordinates of vertices 



of the simplex T, and we discuss its main properties. The matrix Ht G encodes the 
volume, the aspect ratio and the orientation (but not the angles) of the simplex T. 

A riemannian metric on IR d is a continuous map H which associates to any z G M. d a 
symmetric positive definite matrix H(z) G Sj[. We denote by H = C°(lR d , S£) the col- 



5.1. Introduction 



207 



lection of riemannian metrics. The following definition introduces a notion of equivalence 
between meshes and metrics 

Definition 5.1.1. We say that a mesh T G T is Co-equivalent to a given metric H G H ; 
where Co > 1 is a fixed constant, if for all T G T and all z G T one has 

C 2 H(z) <U T < ClH(z) (5.2) 

We say that a collection of simplicial meshes T* C T is equivalent to a collection of 
metrics if there exists a uniform constant Co > 1 such that the following holds : 

- For any mesh T G T* there exists a metric H G H* such that T and H are Co- 
equivalent. 

- For any metric H G H* there exists a mesh Tg T, such that T and H are Co- 
equivalent. 



Our main result (Theorem 5.1.14) states that when the constant C is sufficiently large, 
and if dimension is d — 2, the three classes of meshes T^c C T aj c C T g ,c are respectively 
equivalent to three classes of metrics 

H, : C H„, C H„. 



that are defined by precise smoothness conditions on the function z h> H(z) in £5.1.3 
The most striking point of this result is that, although the matrix Ht does not encode 
the angles of a simplex T, the fact that a mesh T G T a ,c is quasi-acute can be translated 
in a precise regularity condition on the metric H equivalent to T. 

In order to state our results more precisely we need to introduce some notations. 
We first recall some notations of linear algebra. We denote by the collection of d x d 
matrices with real entries. We denote by GL d the standard linear group, by SL rf the special 
linear group and by Od the orthogonal group 



{A G M d ; det A ^ 0} 
{A G M d ; det A = 1} 
{U G M d ; U T U = Id}. 



We denote by S d the collection of symmetric matrices, by S® the subset of non-negative 
symmetric matrices, and by S'j" the collection of symmetric positive definite matrices. 



S d 

ce 



{M G M d ; M = M T } 

{M G S d ; z T Mz > for all z G 

sf n GL d . 



For any M, M' G S d , we write .\/ < M' if M' - M e Sf, and M < M' if M' - M G S+. 
We recall that for any M, M' G S d such that M < M' and any G M d we have T M0 < 
T M'0. Furthermore if M, M' G S'j" and M < M' then M /_1 < M _1 . For each symmetric 
positive definite matrix M G Sj" we define a norm || ■ \\m on IR d as follows : for all u G M d 

II^IIat := m t Mm. 
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5.1.1 Geometry of the simplex 

We briefly recall the definition of a d-dimensional simplex, and of the collection of its 
faces. 

Definition 5.1.2. A d- dimensional simplex T is the convex envelope of a set V dW 1 of 

d+1 vertices, not contained in a d — 1 dimensional affine subspace ofWL d : 

T = Cvx(V). 

We denote by J-'(T) the collection of faces of T of any dimension, 

J-(T) := {Cvx(y') ; V C V}. (5.3) 

Observe that and T are among the faces of a simplex T, since they respectively 
correspond to V — and V = V. We denote by zt G M. d the barycenter of a simplex T 
of vertices V 



and we define a symmetric positive definite matrix Ht £ as follows 

N (l! 7rr^\ln 7.r. 

d + 



Ut 1 = ^-Y.( v - z ^-zt) T . (5.4) 



The next proposition shows that this definition is consistent with the characterization of 
1-Lt given in (5.1), and gives some of the key properties of the matrix Ht- Throughout 
this chapter, we denote by T eq a d-dimensional equilateral simplex, centered at the origin 
i.e. such that 2r cq — 0, and having its vertices on the unit sphere. One easily checks that 

n Tcq = id . 

Proposition 5.1.3. The following holds for any d-dimensional simplex T. 

- For any an affine change of coordinates $ ; $(z) := <pz + zq where <fi G GL^ and zq 
in IR d , we have 

U$-i(T) = <j) T n T <l>. (5.5) 

- There exists a rotation U G Od, depending on T, such that 

ul{T-z T ) = U{T C(l ) ) (5.6) 

hence 

\T\y/detH T = (5.7) 

- We have the inclusions 

{z G R d ; \\z - z T \\ nT < 1/d} C T c £ T ■= {z G R d ; \\z - z T \\ nT < 1}, (5.8) 

and the these two ellipsoids are respectively the one of largest volume included in T, 
and of smallest volume containing T. 
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Proof: For any vertex $ l (v) of the simplex $ 1 {T), we have 

- z$-i(T) = - ST 1 ^) = - z T ). 



Injecting this relation in (5.4) we obtain (5.5). We now turn to the proof of (5.6) and for 



that purpose we remark that the simplices T eq and T" := T-L T (T — Zt) both have their 
barycenter at zero, hence there exists a linear map <fi G GL^ such that T' = </> -1 (T eq ). We 



thus obtain, using the formula (5.5) for the transformation of T-Lt under affine change of 
coordinates 

_ i _ i 
Ht 1 = ^H-rp 2 Hj-l-ij, 2 = Id 

(T oq ) = ^ T ^1 



it 



Hence T = Id which establishes that <fi G and concludes the proof of (5.6). It is 
well known that the smallest ellipsoid containing T eq is the unit ball, and that the largest 
ellipsoid included in T eq is the ball of radius 1/d. Combining this fact with the change of 
coordinates (5.6) we obtain (5.8) which concludes the proof of this proposition. o 



Note that for any simplex T and any z, z' G T we obtain using (5.8) 
\\z - z'\\ Ht < \\z - z t \\ Ht + \\z' - z t \\ Ht < 1 + 1 = 2, 



and 



d 



\ri T 2 \\ < diam(T) < 2\\% 



We introduce a measure of degeneracy p(T) G [1, oo) of a c?-dimensional simplex T 



P(T) :-- 



\K T X \ 



(5.9) 
(5.10) 

(5.11) 



Note that p(T) 



of (5.6) that T 



1 if and only if T-Lt is proportional to the identity, which implies in view 
zt + rU for some r > and U G O^- 
The measure of degeneracy p is slightly different from the measure of degeneracy 
dmr ^p used in Chapters |2j and jij but both have the same role : they are minimal for 
equilateral simplices, and increase as the simplex becomes thinner. 



The measure of sliverness S(T) of a simplex T is a quantity which plays an important 
role in the context of optimal mesh adaptation for the finite element approximation of 
a function in the Sobolev W 1)P norm, see 
of sliverness is defined in Chapter [3] and illustrated by several examples and equivalent 
quantities. The measure of sliverness is defined by 

S(T) := infiJI^IIH^- 1 !! ; G GL d and 0(T) is acute }. (5.12) 

We recall that a simplex is acute if and only if the exterior normals n, n' to any two 
distinct d — 1-dimensional faces have a negative scalar product (n, n') < 0. As observed 
in Chapter [3j the measure of sliverness S(T) can be interpreted as the distance from T 
to the collection of acute simplices. 



10 62 and Chapters and m The measure 
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The exterior normals n, n' to any two distinct d — 1-dimensional faces of T eq satisfy 



(n, n') = —1/d, hence this simplex is acute. Recalling (5.6) we thus obtain for any simplex 
T 

S(T) < p{T). 

Apart from this upper bound, the matrix Ht does not contain any direct information 
on the measure of sliverness S(T). For any bidimensional triangle T of largest angle 9, 
Proposition |3.2.1| of Chapter [3] states that : 



S(T) = max j 1, tan - 

5.1.2 Simplicial meshes 

As a starter, we recall the definition of a conforming simplicial mesh of M. d . 

Definition 5.1.4. A conforming simplicial mesh o/IR d is a collection T of simplices which 
satisfy the conformity axiom : for all T,T' G T 

TnT' G F(T) n .F(T'), (5.13) 

as well as the following properties 

- (Covering) The simplices T G T cover the whole infinite domain lR d : 

|J T = R d . 

TeT 

- (Partition) The interiors of the simplices T G T are pairwise disjoint : for any 
T, T g T, 

int(T) n int(T') ^ implies T = T . 

- (Local finiteness) For any compact set K C M, d , only a finite collection of simplices 
T G T intersect K : 

{T G T ; K n T ^ 0} zs /m^e. 



The conformity axiom ( 5.13[ ) states that the intersection of any two simplices in T 



needs to be a full common face. We denote by T the collection of conforming simplicial 
meshes of lR d . 

For an optimal efficiency, numerous numerical simulation software use anisotropic 
meshes, in which the simplices may have an arbitrary shape. Anisotropic meshes are 
used for instance, to create a thin layer of simplices close to a geometric feature which is 
relevant in a numerical simulation. In order to avoid excessively wild meshes, one often 
requires some consistency between the shapes of neighboring simplices. We therefore in- 
troduce for any constant C > 1 the class T 5 c C T of graded meshes, which is defined as 
follows. 

Definition 5.1.5. A mesh T belongs to T 9i c* if for any two simplices T,T' G T one has 

T n TV => C- 2 H T < U T > < C 2 H T . (5.14) 
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Figure 5.2 - Quasi-acute triangulation (left) and refined triangulation with uniformly 
bounded sliverness (right). 



For any C > 1 we also introduce the class T^c of isotropic meshes, by requiring in 



addition to (5.14) that the measure of degeneracy p is uniformly bounded by C, which 



forbids any anisotropy. 

Definition 5.1.6. A mesh T belongs to T^c if and only ifT & T g ,c and for all T e T 

p(T) < C. 

In order to introduce the intermediate collection T a c of C-quasi-acute meshes, we 
first define the notion of refinement of a simplicial mesh. 

Definition 5.1.7. Consider two meshes T,T' £ T and a constant C > 1. We say that 
T' is a C -refinement ofT if it satisfies the following properties. 

- (Inclusion) Any simplex X" G T' is contained in a simplex T G T. 

- (Bounded refinement) Any simplex T G T , contains at most C simplices T' G T. 

Definition 5.1.8. A graded mesh T G T 5i c belongs to the collection T Ut c of quasi-acute 
meshes if and only there exists a C-refinement T' of T which satisfies for all T' G T 1 

SiT 1 ) < C. 

In particular if T G Ti,c, then choosing T' = T, which is a C-refinement since C > 1, 
and recalling that S(T) < p(T) for any simplex T, we obtain that T G T a ,c- Thus 
Tj t c C T a ,c C T 9i c as announced. 

The constraints defining graded, quasi-acute and isotropic meshes are illustrated on 



Figure [8] in the main introduction of the thesis. Figure |5.2| gives an example of a quasi- 
acute triangulation which by the above described refinement process leads to a slightly 
finer triangulation on which the measure of sliverness S(T) is uniformly bounded. 



5.1.3 Riemannian metrics 

For any metric H G H := C°(R d , S£) and any path 7 G C 1 ([a, b], M d ), we define the 
riemmannian length Ih{i) by the integral 



l H {l) ■- 



1 



H( 7 (t)) 



dt. 
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The riemmannian distance dn between two points z, z' e R d } is the infimum of the length 
of all paths joining z and z' , 

d H (z,z') := inf {l H (j) ; 7 G C 1 ([0, l},R d ) , T (0) = z and T (l) = z'} . (5.15) 

If H, H' G H are such that H(z) < H'{z) for all z G R d , then d H (z, z') < d H >(z, z') for all 
z, z' G R d . By construction, the riemannian distance dn associated to a metric H G H is 
locally equivalent close to a point z to the distance defined by the norm || • \\h(z) '■ for all 
zeR d 

lim sup I lndff(p, q) — In \\p — q\\ii(z)\ = 0, (5.16) 
where B(z,e) denotes the euclidean ball of radius e around z. 



Remark 5.1.9 (Riemannian geodesies). When a metric H is sufficiently smooth, a special 
family of curves named Riemannian geodesies can be defined and studied. A riemannian 
geodesic is a curve 7 G C°(R, M, d ) which satisfies a second order differential equation called 
the equation of geodesies. This equation implies that for any sufficiently close t,t' G R 
the curve 7|[t,f] is the path of smallest possible length joining the points 7(i) and 7(t') .' 
d H ( 1 (t), 1 (t')) = l H ( lim ). 

Geodesies are not defined for general continuous or Lipschitz riemannian metrics, such 
as those constituting the sets H and Hj C H a C H g , since the coefficients of the equation 
of geodesies involve the second derivatives of the metric. Notions related to geodesies, such 
as the injectivity radius of a metric, therefore have no meaning in our context. This is not 
an issue for our purposes since, following the point of view of i 58 1, we study Riemannian 
distances and not Riemannian geodesies. 

Definition 5.1.10. We denote by H« the collection of metrics H G H which are propor- 
tionnal to the identity : for all z G M. d 

HU) = - W 



s(z) 2 ' 



and such that the proportionality factor s satisfies one of the two following conditions 
conditions, which are surprisingly equivalent as shown in Proposition \5.2. 7| : 

- ('Additive" Lipschitz condition) 

\s(z) - s(z')\ < \z - z'\ for all z,z'eR d . (5.17) 

- ("Multiplicative" Lipschitz condition) 

I In s(z) — In s(z') | < dn{z, z') for all z, z G M. d . (5.18) 



The two Lipschitz properties ( 5.17[ ) and (5.18) have a natural generalisation to general 



anisotropic metrics but are not equivalent in that context. In order to introduce the 
regularity conditions defining the classes H a and H 5 of metrics, we need to introduce 
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two distances d x and d + on the set S'j" of symmetric positive definite matrices. For any 
M, M' G Sj we define 



d x {M,M') : = min{<5 > ; e~ 25 M < M' < e 2d M}. 



,2S 



(5.19) 



For instance, a mesh T is C-equivalent to a metric H, as defined in (5.2), if and only if 
for all T G T and all 2 G T one has 

d x (H(z),n T ) <lnC. 

There exists various expressions of the distance <i x : for all M, M' G S% one easily checks 
that 

d x (M, M') = sup | In \\u\\m — In ||m||m'| 

«eR d \{o} 



sup | In \\u\\m — In IM|m' 



and 



d x (M,M') = In fmax{ || ||, \\M'-^M^ 



(5.20) 



(5.21) 



This last expression implies that 



d x {M,M') > -max{|ln||M|| -ln||M'|||, | In HA^T" 1 !! - In ||M /_1 1| |} (5.22) 
2 



The exponential of the distance d x was used in the earlier paper 66 under the name 

?fine a secc 

d+(M, M') 



"relative deformation". We define a second distance d + on as follows 



lAf-2 -M'~h 



(5.23) 



The expressions (5.21) and (5.23) of the distances d x and d + show that they are res- 
pectively tied to multiplicative or additive properties of matrices, which justifies their 
notations. Note also that for any s, s' > one has 



d x (s Id, s' Id) = | Ins — Ins' | and d + (s Id, s' Id) = |s — s 



(5.24) 



Definition 5.1.11. We denote by H g the collection of metrics H G H which satisfy the 
natural extension of (5.18) to general anisotropic metrics : 



d x (H(z), H(z')) < dn(z,z') for all z,z' G 



(5.25) 



This condition can be heuristically described as follows : a metric H G H defines 
close to each point z G M. d a norm || • \\h(z) which encodes an anisotropic notion of scale. 
The equation (5.25) states that the metric H itself is consistent a the scale that it encodes. 



Definition 5.1.12. We denote by H a the collection of metrics H G H which satisfy both 
(5.25) and the natural extension of (5.17) to general anisotropic metrics : 

d+(H(z),H(z')) <\z- z'\ for all z, z G M. d . (5.26) 
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In addition to the results of mesh generation presented in this chapter, we show in 



the next chapter, Lemma 6.4.1 that this condition is critical in order to define a local 
averaging operator compatible with Sobolev norms. 

Remark 5.1.13. Let H G H, and let A > 0. Using the homogeneity properties of the 
distances d x , d + on and du on IR d with respect to H , we obtain that \ 2 H G H g if and 
only if 

d x (H{z), H{z')) < \d H {z, z') for all z, z' G R d , (5.27) 
and X 2 H G H a if and only if we have in addition to the previous property 



d+(H(z),H(z')) < X\z — z'\ for all z, z G 



(5.28) 



In particular the collections Hj C H a C H g of isotropic, quasi-acute and graded metrics 
are stable under multiplication by a constant larger than one : for any * G {a,b,c}, any 
H G H* and any A > 1 we have X 2 H G H*. 



5.1.4 Main results 

The main result of this chapter is the equivalence of the classes of meshes and metrics 
defined above, which is announced in the first part of the introduction and proved in 



sections §5.4| and §5.5| In two cases we were only able to establish this equivalence when 
the dimension is d = 2. 

Theorem 5.1.14. There exists Cq = Co(d) such that for all C > Co the following holds. 

i) The collections T it c of triangulations and H, of metrics are equivalent. 

ii) If d = 2, then the collections T a ,c of triangulations and H a of metrics are equivalent. 
Hi) If d = 2, then the collections T g ,c of triangulations and H g of metrics are equivalent. 

Let us comment on this result. The theory of isotropic meshes is already well developed, 
and our result i) in this direction should be regarded as a reformulation of previous work, 
intended to put into perspective the results on anisotropic meshes. The main ingredient 



of the proof of i) is the mesh refinement procedure exposed in 77 



In the case iii) of graded metrics the key ingredients used for the construction of 
a bidimensional triangulation T G T g ,c equivalent to a metric H G H g come from the 
paper of computational geometry [66]. In contrast the arguments used to produce a metric 



H G H g from a mesh T G H g ,c hold in any dimension. 

Last the author has not heard of any anterior results on the collection T a C of quasi- 
acute meshes, and most of the techniques used in that context are new. 



Conjecture 5.1.15. The author conjectures that the points ii) and iii) of Theorem 5.1.14 
hold without restriction on the dimension d > 2. 



This chapter is organised as follows. We establish in section ^5.2 some general pro- 



perties of the metrics in H a or H g . We first prove in £5.2.1 a property of invariance of 



the collection H„ of graded metrics with respect to affine changes of coordinates. The 



regularity assumptions (5.25) and (5.26) defining graded and quasi-acute metrics simply 
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mean that the map H : M. d — > is Lipschitz with respect to some distances on IR d and 
and we therefore study metrics from this point of view in £5.2.2 We focus in £5.2.3 
on the "geometrical properties" of the space M. d equipped with the distance dn and the 
measure -^/det H(z)dz associated to a graded metric H G H s . In particular we compare 
the dn with the euclidean distance and we estimate the volume of balls. 

Section £5^3 is devoted to the study of metrics H G H such that the symmetric matrix 
H(z) has an eigenspace of dimension at least d — 1 at each point. This condition clearly 
holds if the dimension is d — 2 but is also relevant in some applications to higher dimension 
such as the creation of a thin layer of simplices close to a d — 1 dimensional surface, as 
discussed in §6.3 in the next chapter. We give an approximate translation of the regularity 
properties (5.26) and (5.25) defining the collections H a and H 9 of metrics in terms of the 
two eigenvalues of H and of the direction of the eigenspaces of such metrics. 



The last two sections £5.4 and £5.5 are devoted to the proof of Theorem 5.1.14 More 
precisely we depict in £5.4 to construction of a metric which is equivalent to a given mesh. 
Conversely we address in £5^5 the production of an isotropic, quasi-acute or graded mesh 
equivalent to a given metric H G H i; H a or H„. 



5.2 General properties of graded and quasi- acute me- 
trics 

We study in this section the general properties of the distinguished classes of metrics 
Hj C H a C Hg. We first establish some invariance properties of these collections of 
metrics with respect to affine changes of coordinates. We then regard metrics as Lipschitz 
functions H : IR d — > S^. The last subsection is devoted to the comparison of the distance 
dn and the measure a/ det H(z)dz associated to a graded metric H G H 9 , with respect to 
the standard euclidean distance and Lebesgue measure. 



5.2.1 Invariance properties and restriction of metrics 

We focus in this subsection on the properties of invariance of the collections of meshes 
Tj t c C T a ,c C T 5 c and of metrics Hj C H a C H g introduced in this chapter. 

Let $ : lR d — > M. be an affine change of coordinates, $>(z) := <pz + Zq where G GL^ 



and Zq G Mr. We have for any simplex T, as previously observed in (5.5), 

H*-ht) = T %T0- (5.29) 

Mimicking this property we define for any H G H a transported version Hq, G H as 
follows : for all z G M d 

H*(z) = <j) T H(<S>(z))(f). (5.30) 
In the language of differential geometry, the metric Hq> is generally referred to as the "pull 



back" of H by $. If a mesh T G T is C-equivalent to a metric H G H, then (5.29) and 



(5.30) clearly imply that the mesh <3> -1 (T) is C-equivalent to the metric H$. 

The next proposition studies the invariance of the collections T^c of meshes and T* 
of metrics, * G {i, a, g}, under the transformations T h-> $ _1 (T) or H i— > H$ induced by 
an affine change of coordinates $. 
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Proposition 5.2.1. Let <3> : M. d — > Mr be an affine change of coordinates, $(z) := <pz + Zo 
where <p £ GL^ and z G M d . The following holds for any constant C > 1. 

i) IfTe T fM c then $- x (T) G T fliC . IfHeUg then H$ G H 9 . 

iz ^ Assume that $ is an isometry, in other words <p e Oj- 

IfTe T ajC toen $ _1 0T) G T a>C; and similarly if T G T i)0 toen $ _1 (T) G T i)C . 
If H G H a i/ien G H a; and similarly if H G Hj iaen G Hj. 

Proof: We first establish the properties announced for meshes, and we then focus on 
metrics. Let M, M' G Sj, let C > 1 and let G GL d . Clearly 

if C~ 2 M < M' < C-M then C V .Wo < o' .W'o < f'V.Wo. 



In view of the definition (5.14) of T 9i c we obtain that $ HT) G T 9i c for any mesh 
T G T g c as announced in i). 



If $ is an isometry then for any simplex T we obtain, using the definitions (5.11 ) and 



(5.12) of the measure of degeneracy p and the measure of sliverness S, that 

p($- 1 (T)) = p(T) and S(*-\T)) = S{T). 

This implies the properties of T a ,c and T^c announced in b), which concludes the proof 
of the properties announced for meshes. 

We therefore turn our attention to metrics. The fact that H$ G H g for any H G H g 



and any affine change of coordinates $ is a special case of Proposition 5.2.2 below. We 
thus admit this property and we focus on isotropic and quasi-acute metrics and isometric 
changes of coordinates. 

If H G Hj, then H = ld/s(z) 2 where s : M d — > is a Lipschitz function. Since 
(f) G O d we have T = Id and therefore 

H 9 (z) = (j) T H(<S>(z))(j) = Id/s($(z)) 2 . 

The map s o $ : R d — y M* is Lipschitz since it is the composition of a Lipschitz function 
with an isometry, which establishes that Hq, G Hj as announced. 

Since (f> G Od we have for any symmetric matrix M G Sj and any exponent a G M 

(0 T M0) a = (j)M a (j). 

Consider H G H a , then for any z, z' G M d 

\\H^z)-^ - H^z'YH = U T [H(1>(z))-^ - Hmz'))- 

= \\H(^(z))-^ - H(^(z'))-^\\ 

< \Mz)-*w\\ 

z-z')\\ = \z-z'l 



which establishes that H$ satisfies the Lipschitz regularity condition (5.26). We already 
know from the case of graded metrics that the other regularity condition (5.25 ) is satisfied 
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by H$ } which establishes that H<& G H a and concludes the proof of this proposition. 



o 



In practical applications, one often needs to mesh simultaneously a domain Q and a 
surface T embedded in Q. In terms of metric this raises the following question : given a 
metric H on Q which satisfies certain properties of regularity what is the regularity of the 
restriction of this metric to F ? 

The next proposition answers this question is the simplified setting where Q is the 
infinite domain M. d , T is an affine subspace of R d , and properties of regularity are those 
defining the collection H 9 of graded metrics. For any 1 < k < d we denote by H g (lR fe ) 
the collection of graded metrics on R fc (with the convention that H fl (R) is the collection 
Lip(R, W + ) of Lipschitz functions from R to R* + ). 

Proposition 5.2.2. Let 1 < k < d, let $ : R fc — >• R d be an affine injective map, &(z) = 
(pz + zq where <f> G Md,k has rank k and zo G R d . Let H G H and let for all zeR^ 

IfHeUg then H$ G H 9 (R fc ). 
Proof: For any u G R d , one has 



Hence for any z, z' G R k we obtain using (5.20) 



ln II <I>(U) II H(*(z)) - ln II <K M ) II HW*>)) 



= sup 

u£TR k \{0} 

< d x (H($(z)), H($(z'))) 



For any path 7 G C 1 ([0, l],R fe ), it follows from (5.31) that 



ll0(T'(*))IU($(7(t)))^ = ^»(t)- 



Taking the infimum among all paths 7 G C 1 ([0, l],R fe ) joining two points z,z' G 
obtain since $07 g C 1 ([0, 1], R ) is a path joining $(2;) and $(2;') 

d H mz),<S>(z'))<d H0 (z,z'). 



(5.31) 



(5.32) 



Combining (5.32) and (5.33), we conclude the proof of this proposition. 



K , we 

(5.33) 

o 



5.2.2 Metrics as Lipschitz functions 

Definition 5.2.3. Let (X,dx) and (Y, aly) be metric spaces and let f G C°(X,Y) be a 
continuous function. We define the dilatation dil(/) G [0, 00] of f as follows 

dil(/) := sup — 

x,x'ex d x {x,x') 

x^x 1 
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We say that / is Lipschitz if and only if dil(/) < 1, and more generally that / is 
A-Lipschitz if and only if dil(/) < A. 

We define the local dilatation di\ x (f) of a function / G C°(X, Y) at a point i6las 
follows 

da.(/):=limdU(/| fl( ^))=liinf snp Mm,M) \ (5 M) 

\p,qeB(x,s) d x (p,q) J 

We specify the distance function on X or Y if it is not the expected "canonical one", 
for instance if X or Y is a subset of lR fc and if the associated distance is not the standard 
euclidean distance but the riemannian distance du associated to a metric, or if X or Y is 
the collection 5j~ of symmetric positive definite matrices since none of the two distances 
d + or d x is canonical. The local dilatation is in that case denoted as follows 

diL,(/; d x ,d Y ), dH x (f; d x ) or dil x (/; d Y ). 

The local dilatation dil x (f) only depends on the local properties of the metric spaces 
close to x and f(x). Consider the metric space (R d , dn), where H G H is a fixed riemannian 
metric, and an arbitrary metric space (Y, dy)- For any / G C°(IR a! , Y) and any z G M, d we 
obtain using (5.16) 

\\H{zY l \\-^ dil,(/) < dil,(/ ; d H ) = dil z (/; || ■ \\ H{Z) ) < Vfff5)]fdil z (/) (5.35) 
We also introduce the lower local dilatation dil*(/) < dil z (/) which is defined as follows 

dii :( / ): =li m f inf 

\ P ,q&B(x,e) d X (p,q) J 

The next proposition establishes that local dilatations are sub- multiplicative. 

Lemma 5.2.4. Let (X,dx), (Y,dy) and (Z,dz) be three metric spaces, and let f G 
C°(X,Y) and g G C°(Y, Z). For any x G X one has 

di\ x (gof)<di\ f{x) (g) di\ x (f), (5.36) 

provided no indeterminate product x oo orooxO appears in the right hand side. Similarly 
we have 

di\ x (gof)>dir f{x) (g)di\ x (f), (5.37) 
provided no indeterminate product x oo or oo x appears in the right hand side. 



Proof: We first establish (5.36). If di\ x (f) = oo or dilf( x )(g) = oo, then there is nothing 
to prove. We may therefore assume that dil x (/) < oo and dilf( x \(g) < oo. For any e > 
we define 

771/ x d Y (f(p),f(q)) d z (g(p),g(q)) 

F[e) = sup — — r and G(e) := sup — — — r — , 

P ,qeB(x,e) d x {p,q) p,qeB(f(x),s) d Y {p,q) 

we thus have 

gup W/(P))^(/(g))) < g(F(e)e)J r (e) . 

p,q€B(x,e) d X {P,q> 
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Taking the limit as e — > we obtain conclude the proof of (5.36). For the proof of 
(5.37) we consider two sequences (p n )n>o, (<?n)n>o which both tend to x and such that 
dvifiPn), f{q n ))/dx(Pn, Qn) -> dil^ (/) as n -> oo. We thus have 

dz(g(f( P n)),9(f(qn))) d z (g(f(p n )),g(f(q n ))) dy(f(p n ), g(f(q n ))) 



dx(p n ,q n ) 



dY(f(Pn)J(q n )) 



dx(Pn,q n ) 



and taking the limit as n — >■ oo we obtain (5.37) which concludes the proof of this lemma, o 

Let / : Q — > V be a C l function, where Q C M d is an open set and V is a Banach 
space. Then for any z G f2 the local dilatation dil 2 (/), and the lower dilatation dil*(/) 
have an explicit expression in terms of the differential d z f : M. d — > V of / at z 



dil z (f) = sup and dil* (/) = inf \\d z f(u) 

\u\=l \ u \ =1 



(5.38) 



Applying Lemma 5.2.4 and (5.38) to the function g = In we obtain for any metric space 
(X, d x ), any / G C°(X, R* + ) and any x G X 



diU(ln/) 



dil x (f) 



(5.39) 



The following proposition shows that, under certain circumstances, the Lipschitz pro- 
perty is a local property. 

Proposition 5.2.5. Consider the metric space (]R d ,(i^) ; where H G H is a fixed rieman- 
nian metric, and an arbitrary metric space {Y^dy). For any f G C (M. d ,Y) and any path 
7 G C\[0, l],R d ), 7(0) = z, 7(1) = z', one has 



It follows that 



d Y (f(z),f(z')) < l H (l) sup dil 7 ( t) (/; d H ) 

0<t<l 



dil(/; d H ) = sup di\ z (f; d H ). 



(5.40) 



(5.41) 



Proof: We equip the segment [0, 1] with the distance d 1 defined for < t < t! < 1 by 

d 7 (t,f) = d r (f,t) = l H (rf\[t,v])- (5-42) 
Note that for any t < t* <t' one has 

d 7 (t,t') = d^(t,U) + d 7 {U,t') (5.43) 



The map 7 : ([0, 1], c? 7 ) — > (R d , djj) is clearly Lipschitz. It thus follows from Lemma 
that F := f o 7 : [0, 1] Y satisfies for all t G [0, 1] 



5.2.4 



dih(F) < dil 7{4) (/) < A := sup dh>)(/). 

0<t'<l 
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We assume that A < oo, otherwise there is nothing to prove, and we consider a fixed 5 > 0. 
For each t G [0, 1] there exists therefore an interval V t C [0, 1] containing t, relatively open 
in [0, 1], and such that F\ Vt is (A + <5)-Lipschitz. Since the segment [0, 1] is compact, there 
exists a finite set Iq C [0, 1] such that 



[0, 1] 



IK 

teio 



(5.44) 



Let V, V C [0, 1] be two intersecting intervals, and assume that F is A + 5 Lipschitz on V 
and on V . For any t G V and if G V, there exists £* G V D V which satisfies t < t* <t' 



or t > t* > Recalling (5.43) we thus obtain 



dy(F(t),F(t')) 



< d Y (F(t),F(Q) + d Y (F(Q,F(if)) 

< (X + 8)d 1 (t,U) + (X + 8)d 1 (U,t') 
= (A + 5)d y (t,t'). 



Thus .F is A + 5 Lipschitz on the interval V U V . Since [0, 1] is covered by the finite 



collection of open intervals (5.44) on which F is A + 5 Lipschitz, we obtain proceeding by 



induction that F is (A + 5)-Lipshitz on [0, 1] and therefore 

d Y (f(z), f{z')) = d Y (F(0), F(l)) < (A + S)dJ0, 1) = (A + S)l H (rf). 



Letting 5 — > we obtain (5.40). Taking the infimum of ( |5.40 ) among all paths 7 G 
C 1 ([0, l],IR d ) joining the points z and z' we obtain dy(f(z), f(z')) < Xdn(z, z'). Therefore 
dil(/; djj) < A. Conversely we have diL(/; dn) < dil(/; du) for any z G M d , which esta- 
blishes (5.41) and concludes the proof of this proposition. 



The next corollary shows that the global dilatation of a function is controlled by the 
local dilatation on R d \ T, if F is a sufficiently thin set. For instance the skeleton UrerdT 
of a mesh T G T. 

Corollary 5.2.6. Let T C M. d be a set which has the following property : for any path 
7 G C 1 ([0, there exists a sequence of paths (7 ra ) n >o which converges to 7 in C 1 norm 

and such that {t G [0, 1] ; j n (t) G T} is finite for each n. 

Let H G H, let (Y,dy) be an arbitrary metric space, and met let f G C°(M. d ,Y) . Then 

dil(/; d H ) = sup dil z (/; d H )- 



Proof: We denote A := sup zgRd \ vr dil z (/). We consider two points z, z' G M. d and a path 
7 G C\[0, l},R d ) such that 7(0) = z and 7(1) = z' . 

We first assume that the set {t G [0, 1] ; 7(t) G T} is finite, and we denote its elements 
by t\ < ■ ■ ■ < tfc. We set by convention t = and tk+i = 1. For each < i < k and 
each t, t' such that U < t < if < t i+ i we have dy(/(7(t)), f(j(f))) < Aifl-(7|[t,f]) according 
to Proposition 5.2.5, Letting t U and £' — >■ t i+i we obtain dy(/(7(£i)), /(t(^+i))) < 



Aiif (7[[ti,t i+ i]) • Adding up we obtain 

d Y (f{z),f(zf)) < dy(/(7(*i)),/(7(**fi))) < A ^ ^(7[ ti ,t i+l] ) = A/ ff ( 7 ). 



0<i<fc 



0<t<fc 
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We now consider the case where 7 meets the set T an infinite number of times, and 
we choose a sequence (7 n )n>o as described is the statement of the corollary. We obtain 
dy (/(7n(0)), /(7 n (l))) < Xlniln) for each n > 0. Taking the limit as n — > 00 we obtain 
dy(f(z), f(z')) < \Ih(i) since j n converges to 7 is C 1 norm as n — > 00. 

Taking the infimum among all paths 7 G C 1 ([0, l],M d ) joining the points z and 2', we 
obtain dy(f(z), f(z')) < \cIh(z,z') which concludes the proof of this proposition. o 



The next proposition establishes that the regularity conditions (5.25) and (5.26) defi- 
ning the collections of quasi-acute or graded metrics are equivalent in the case of a metric 
proportionnal to the identity. 

Proposition 5.2.7. Let s G C°(R d ,R^) and let H : = Id / s 2 . We have for all z eR d 

dil z (lns; d H ) = dil*(s). (5.45) 

Furthermore the following properties are equivalent. 

1. The map s : R d -)■ R* + is Lipschitz. (i.e. H GH,j 

2. ("Additive" Lipschitz property) For all z,z' G R d one has d + (H(z), H(z')) < \z — z'\. 

3. ("Multiplicative" Lipschitz property) For all z^z' G R d one has d x (H(z), H(z')) < 
d H (z,z'). 



Proof: Combining (5.35) and (5.39) we obtain for any z G K 

dil z (lns; dn) = s(z) dil 2 (ln s) = dil 2 (s), 



which establishes (5.45). According to (5.24) we have for any z, z' G Mr 

d + (H(z),H(z')) = \s(z)-s(z')\ and d x (H(z), H(z')) = | \ns(z) - \ns(z')\. 
The properties 1. and 2. are thus equivalent to dil(s) < 1, and therefore also to : 

dil z (s) < 1 for all z G R d , (5.46) 



according Proposition 5.2.5 applied to the constant metric H = Id and to the function 
/ = s. On the other hand property 3. is equivalent to dil(lns; dn) < 1, and thus to : 

dil 2 (lns; d H ) < 1 for all z G R d , 

according to Proposition 5.2.5| applied to the metric H and to the function / = In s. 
This property equivalent to (5.46) according to (5.45), which concludes the proof of this 
Proposition. o 



The next corollary uses Proposition 5.2.7 to analyse the regularity of the first and last 
eigenvalue of a metric. 



Corollary 5.2.8. 1. For any H G H g , the map z 1— > \\H(z) 
2. For any H G H a; the map z 1— > \\H(z)~^\\ is Lipschitz. 



is Lipschitz. 
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Proof: We first establish Point 1. We consider if G H 9 , and we define s{z) := \\H(z)\\~^ 
and 

H\z) :=^L = \\H(z)\\Id, 

for each z G R d . Since if (z) < H'(z) for all z G R d , we have d H (z,z') < d H ,(z,z') for all 
z } z' G R d . It follows that, using (|5.22b, 



\ns(z) -\ns(z')\ = -\\n\\H(z)\\ -\n\\H(z') 

< d x (H(z),H(z')) 

< d H (z,z') 

< d w (z,z'). 



Applying Proposition 5.2.7 to the metric if' we obtain that if' G H i; and therefore that 



s is Lipschitz as announced. 

The proof of Point 2. is more straightforward, since for any metric if G H a we have 



lifter*!! - HffteV* 



< WHlzY* -Fte')"*ll < U- z'\ 



for all z, z' G M d , which concludes the proof of this corollary. o 

Our last proposition studies the local dilatation of the maximum and the minimum of 
two functions. 

Proposition 5.2.9. Let (X,dx) be a metric space and let A,/i G C°(X, M). Let 7 : = 
min{A, /i} and T := max{A, u.}. Then for all x G X 

max{dil a! (7) J dil x (r)} < max{dil se (A), dil* (//)}. (5.47) 

If (X,d x ) = (R d ,d H ), where if G H is a riemannian metric, then the above inequality is 
an equality 

max{dil x (7; d H ), di\ x (T; d H )} = max{dil x (A; d H ), dil^/x; d H )}. (5.48) 

Proof: We have for any p, q G X, 

\j(p)-j(q)\ = |min{A(p),/i(p)}-min{A(g),/i(g)}| 
< max{|A(p) - X(q)\, |/x(p) - fi(q)\}. 

For any x G X and any e > we thus obtain 

1-u \ It(p)- 7(9)1 

dll(7|B(a;,e)) = SU P - 

p,q£B(x,e) a X{P,q) 

< max{|A(p) - A(g)|, |/i(p) - fj.{q)\} 

~ p,q&B(x,e) d X (p,q) 

j \x(p)-\(g)\ Hp) -Ml 

= max < sup — — — r — , sup — — — r — 

[p,q£B{x,e) «X (P, <?) p,qeB(x,e) «X (P, <?) 

= maxIdi^AiB^^dil^iB^))} . 
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Letting e — > we obtain dil x .( 7 ) < max{diLj(A), di\ x ([i)}. Proceeding likewise for T we 



conclude the proof of (5.47). 



We now turn to the proof of (5.48). For that purpose we consider a fixed e > and 
we define 

K := max{dil(7| jB ( :!; , e ); || • \\h(x)), ^(X\B{x, £ y, \\ • ||ff(x))}, 

where B(x,e) stands for the euclidean unit ball of radius e centered at x. Consider two 
points p,q G B(x,e). If X(p) > fi(p) and X(q) > p,(q), or if X(p) < p,(p) and X(q) < fi(q), 
then 

\X(p)-X(q)\ = max{| 7 (p)-7(9)Ur(p)-r(g)|} 
< K \\p - q\\ H ( x ) 

Otherwise if X(p) > fi(p) and X(q) < fi(q), or X(p) < fi(p) and X(q) > p,(q), then there 
exists a point r on the segment \p,q] such that A(r) = /i(r). We thus have 

|A(p)-A(g)| < max{|7(p)-7(r)|,|r(p)-r(r)|} 
+ max{| 7 (r) - 7 (g)|, \T(r) - T(q)\} 
< K\\p -r\\ H{x) + K\\r - q\\ H (x) = K\\p - q\\ H (x) 

If follows that di\(X\B( x ,e)', || • \\h{x)) < K, and letting e — > we obtain 

dil^A; || ■ \\h(x)) < max{dil a; (7; || ■ \\h(x)), di\ x (T; \\ ■ \\h( x ))}- 

Recalling that di\ x (f; d H ) = dil x (f; \\ ■ \\h(x)) for any / G C°(lR d ,lR) and any x G R d , see 
(5.35), we obtain diLj(A; dn) < max{dil a; (7; dn), dil x (r; du)}- Proceeding likewise for \x 
we obtain (5.48) which concludes the proof of this proposition. o 



5.2.3 Geometric properties of the space (M d ,<i#) 

We focus in this subsection on the "geometrical properties" of the space lR d equipped 
with the distance dn and the measure y/ det H(z)dz associated to a graded metric H G H g . 
The next proposition compares the riemannian distance dn between two points z and z+u 
with the norm ||tt||#( z ) of their difference. 

Proposition 5.2.10. Let H G H 9 and let z G M d . For all u G W 1 one has 

ln(l + ||-u||fj-( z )) < d H (z, z + u) < - ln(l - \\u\\ H{z) ), (5.49) 
where the right hand side equals oo by convention when \\u\\h( z ) > 1- 
Proof: We consider a path 7 G C 1 ([0, and we define for each t G [0, 1] 

Kt) ■= [ h'(t)\\ H{7m dt. (5.50) 
^0 
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Since H G H 9 we have for any t G [0, 1] 

exp(-2d H (z^(t)))H(z) <if( 7 (t)) <zx V {2d H {z :1 {t))) H{z). 
Therefore, since dn(z,y(t)) < l(t), 

exp(-2l(t))H(z) < H{y(t)) < exp(2l(t))H(z). 



It follows that 

hence 
We have 



{z) exp(-l(t)) < I' it) = \W(t)\\ H{l[t)) < \W{t)\\ Hit) exp(l{t)), 



-^exp(-/(t)) < h'(t)\\ H(z) < ^exp(Z(t)). 



\u\\h(z) = ||7(1) -7(0)llff(*) 



< 



17 



H(z) 



dt 



(5.51) 



(5.52) 



with equality if 7(i) = z + tu for all t e [0, 1], i.e. 7 is a straight line. For that specific 
path 7 we obtain 



u 



H(z) 



7 



dt 



exp(-/(t)) dt = 1 - exp(-Z H (7)). 



Rearranging the terms we obtain the right part of (5.49) : 

d H (z, z + u) < l H {j) < - ln(l - ||«||if( z )). 

We now consider an arbitrary path 7 joining z and z + u and we obtain integrating the 
right part of (5.51) and recalling (5.52) 

-1 



H(z) 



< 



n 







H(z) 



dt < 



dt 



exp(/(t)) dt = exp(Z H (7)) 



which is equivalent to Ih{i) > l n (l + || M l|_H"(z))- Taking the infimum among all paths 
7 G C 1 ([0, l],M d ) satisfying 7(0) = z and 7(1) = z + u, we obtain the left part of (5.49), 
which concludes the proof of this proposition. o 



We discuss in the rest of this section the consequences of the comparison (5.49) of the 
riemannian distance d# with the distance associated to the norm || • \\h(z) when H G H g . 
Combining the right part of (5.49) with the expression (5.20) of the distance d x we obtain 
an estimate of the local variations of the norm || ■ \\h(z) associated to the metric if at a 
point z : let H G H s , and let z,u,v G M. d be such that < 1, then 

(1 - |M|//( 2 ))|M| J F/(Y) < < (1 - ||M||//( 2 )) _1 |kl|//(z)- (5.53) 

Note also that 

(1 - \\u\\ H{z) ) d y/detH(z) < ^detH(z + u) < (1 - \\u\\ H{z) )- d y/detH(z). (5.54) 
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Another important consequence of (5.49) is that for any z G M. d and any r > the closed 
ball {z' G M. d ; dn(z,z') < r} for the distance djj, is a closed and bounded subset M d , 
hence is compact. 

The following example establishes that (5.49) is a sharp inequality. The metric H 
defined by 

H(z) := (TTRF 

belongs to H« since z >->■ 1 + \z\ is Lipschitz, hence H also belongs to H a and H 9 . For any 
v G M d , one easily checks that the path of minimal length joining to v is the straight 
line, and that 

d H (p,v)=1n(l + \v\). 
Choosing z = and u = v we obtain that the left part of ( |5.49 ) is sharp 

ln(l + ||u||tf(o)) = ln(l + |m|) = d H (0, + u). 

Choosing z = —v and u — v we obtain that the right part of ( 5.49[ ) is sharp 





u 




1 + 


u 



The next corollary uses Proposition |5.2.10 to obtain a lower bound for the mass of 
balls in the measure z >->■ ^det H(z)dz associated to a graded metric H G H 9 . For any 
metric H G H, any z G lR d and any r > we define the ellipse 



B H (z, r) := {z + u; \\u\\ H(z) < r} 



and we observe that 



\B H {z,r)\ = ur d (detH(z))-z 
where u denotes the volume of the standard euclidean ball of radius one. 

Corollary 5.2.11. There exists c = c(d) > such that the following holds. For any 
H G H„, any r > 1 and any z G IR d we have 



L 



Vdet H > clnr. 



(5.55) 



B H (z,r) 



Proof: It follows from Proposition 5.2.10 that dn(zo, z) > ln(r + 1) for all z G 8Bh(z, r). 
We define the integer 

ln(r + l)-i 

and we consider k points z±, ■ ■ • ,z k G E such that d H (z , z { ) = i for all < i < k. We 
define ro := 1 — e~5 = 0.39 • • • in such way that — ln(l — Tq) = 1/2. We have according 
to (5.54) for all z G B H (zi,r ) 

VdetH(z) > (1 - \\z - ZiWH^Yy/detHizi) > (1 - r )Vdet # (5.56) 
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For any z G B^iz^r^) we have 

d H (z , z) < d H (z , Zi) + d H {z h z) <i- ln(l - r ) < k + ^ < ln(r + 1), 

which implies that B}{{zi,ro) C Bn{z,r) for all < i < k. Furthermore for any < i < 

j < k, z G B H (zi,r ) and z' G B H (zj,r ) we have 

d H {z,z') > d H {zi,Zj) - d H (zi,z) - d H {zj,z') > \d H (z ,Zi) - d(z ,Zj)\ + 21n(l - r ) = 0, 
hence Bn(zi, r ) fl Bn(zj, r ) = 0. It follows that 



f VdetH > I VdetH 

JB H (z,r) ()<i<k J B H{zi,r ) 



0<i<k jB H( z ^ r o) 

> J2 ('i--ro) d V^H(z i )\B H (z i ,r )\ 



0<i<k 



= ur d (l-r ) d (k + l) 
> co(ln(r + l)-l/2), 

where Cq := utq(1 — r ) d . Since r > 1 we have 

ln(r + !)--> ( l \ + 1 ^ - - ) + (1 1 ln(r + 1) > ( 1 ] lnr 

V ; 2~V 21n2 2y V 21n2 / V 21n2 

which concludes the proof with c = (1 — 1/(2 In 2))cq. 



5.3 Metrics having an eigenspace of dimension d — 1 
at each point 

We focus in this section on metrics H G H such that the symmetric matrix H(z) has 
an eigenspace of dimension at least d — 1 at each point 2 G lR d . Note that this condition 
clearly holds if the dimension is d — 2, but is also relevant in some applications to higher 



dimension as illustrated in ^(^3 of the next chapter. The main result of this section is 
Theorem 5.3.1 which characterises those of these metrics which belong to H a or H g in 
terms of the regularity of their eigenvalues and of their eigenvectors. 

We denote by § := {9 G M. d ; \9\ = 1} the euclidean unit sphere of M. d , equipped with 
the distance 

ds(M') : = arccos((0,0'))- 
We denote by A the space of parameters 

^:=k;xk;x s, 

and we define a map S : A — > as follows 

S(X, fi, 6) := Xd6 T + /i(Id -00 T ). (5.57) 
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The matrix of S, in any orthonormal basis B of R d which begins with the vector 9, has 
the following form 

/ A ••• \ 
n ••• 

fx 

; o '•. 



\ 



J 



We also define for any a = (A, /i, 9) G A 

H{a) := S(a)- 2 = X~ 2 99 T + //~ 2 (Id -66 T ). 

For any metric H G H and any z 6 we denote by dil z (H) x and dil z (i/) + the local 
dilatations at z associated to the Lipschitz conditions (5.25) and (5.26) defining H g and 
H a : 

di\ z (H) x := dil,(F; d x ,d H ) and dH z (H) + := di\ z (H; d + ). (5.58) 

Let Vl C M. d be an open set, let a G C°(Q,A), a(z) = (\(z), fi(z),9(z)), and let z G O be 
such that A(z) ^ li{z) or dil 2 (6 l ) < oo. We define 



D z (a) x := max <^ dil 2 (lnA; d ff ), dil z (ln//; d H ), 



X(z) fi(z) 



fi(z) \{z) 



d\U6\ d 



H, 



and 



DJa) 



max{dil 2 (A), dil 2 (/x), \\(z)-ii{z)\til z (e)} 



(5.59) 



(5.60) 



Note that these quantities are not defined if X(z) = fi(z) and di\ z (9; du) = oo simulta- 
neously, since an indeterminate product x oo appears in ( |5.59 ) and (5.60). 

Theorem 5.3.1. Let H G H. Assume that there exists an open set Q C Mr and a 

continuous function a G C°(Q,A), a(z) = (X(z), fj l (z),9(z)), such that H = % o a on Q. 
For each z G £1 such that X(z) ^ fi(z) or dil z (0) < oo one has 



D z (a) x < di\ z (H) x < 2D z {a) x 
D z {a) + <di\ z {H) + < 2D z (a) + . 



(5.61) 
(5.62) 



The rest of this section is devoted to the proof of this theorem and to Corollaries |5.3.5| 
and 5.3.6 Our first intermediate result in the proof of Theorem 5.3.1 defines and estimates 
a quantity A (a, 6, c) which appears repeatedly in the rest of the proof. 

Lemma 5.3.2. For each a,b,c 6l we define 



A(a, b, c) 



/ a b ••• \ 

b c •• 

c 

; ; o 



■ / 



(5.63) 



Then 



max{|a|, \c\} < A(a, b, c) < 2 max{|a|, |6|, |c|}. 



(5.64) 
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Proof: We consider fixed values of a, b, c G R and we denote by M the matrix appearing 



in (5.63), in such way that A(a, b, c) = \\M\\. We also define A := max{|a|, \b\, \c\}. We 
have 

A(a,6,c) = ||M|| > max |M«| = A, 

l<i,j<d 



which establishes the left part of ( |5.64 ). For all p G [1, oo] we denote by V the usual norm 
on M d of exponent p. We have 

||M||,oo ,,cx, = max IMJ < 2A and ||M||,i_,,i = max |M W | < 2A. 

Ki<d 1 Kj<d ' 

l<j'<d l<l<<i 

By interpolation we thus obtain 



||M|| = ||M|| M < v/HMII^illMll^oc < 2A, 
which concludes the proof. o 

The parameter space i = §x x is a differential variety, and the regularity of 
functions defined on A should be seen through local charts. For each a = (A, /z, 9) G A we 
introduce a local chart ip a of a neigborhood of a in .A. 

V> a : (-A, oo) x (-//, oo) x (5(0, 7T) n 9 X ) -± A, (5.65) 

where 9 1 - := {6 G M d ; 9) = 0} denotes the space orthogonal to 9, and 5(0, 7r) the 
open euclidean ball of radius 7T. We define 

V>«(A, M, r9) := (A + A, /i + M, cos(r)# + sin(r)0), 

where |6| = 1. Note that ip a is one to one and that d§(6, cos(r)9 + sin(r)O) = r. 

Let V be a banach space. We say that a function <p : ^4 — > V is C 1 if for any oGi 
the function (p o ip a is C 1 . In that case for any a & A we define the differential 

d ^(A) : R x R x 9 X -»• V 

by the formula 

4^) := d ( V o « W = lim *■<■»■(")-*■(■») , 

Proposition 5.3.3. Define V + := Sd, equipped with the standard norm || • || ; and <£>+ := 5 

on A. Then for all a,b G A we have 

d + (H(a),H(b)) = y + (a)-<p + (b)\\. (5.66) 

Furthermore (p + G C 1 ^, V+) and /or a// a = (A, 6 1 ) G A and a// A = (A, M, r9) G T a .A, 
wzt/i |9| = 1, we have 

IK^+^IU = A(A, (A-/i)r, M). 



5.3. Metrics having an eigenspace of dimension d — 1 at each point 



229 



Proof: The identity (5.66) directly follows from the fact that T-L{a)~2 = S(a) = </?+(a), 
and from the definition (5.23) of d + . The function ip + = S is C 1 (in fact C°°), since it has 
a polynomial expression 

s(\,u.,e) -.= xee T + fi(i-ee T ). 

We consider a fixed point a = (X,/i,9) G ^4, and A = (A, M, rO) G T a ^4, where |0| = 1. 
We thus have for t G M 

5(^a(M)) = (A + tA)(# + tre)(# + tr9) T 

+ (/i + tM)(Id-(0 + trQ){9 + tr6) T ) + £>(t 2 ) 
= S(X, n, 9) + t{A99 T + M(Id -^ T ) + r(A - //)(0e T + 60 T )) + 0(f) 



Therefore 



d a S{A) = A99 T + Af (Id -99 T ) + r(A - fi)(9Q T + Q9 1 



(5.67) 



Choosing an orthonormal basis B of W 1 which begins with the unit orthogonal vectors 9 
and 9, we obtain 



[d a S(A)] B 



( A r(X-n) ■■■ \ 
r(X-fj) M ■■■ 
MO 



V 







which concludes the proof of this proposition. 



o 



Proposition 5.3.4. Define V x := C°(S,R), equipped with the \\ ■ ||oo norm, and define 
ip x : A — > V x as follows : for any a G A 

ip x {a) := (u H> In ||u||«( a ))- 

Then for all a,b G A we have 

\\<Px(a) 



X \"l 00 



d x {H(a),H(b)). 



(5.68) 



Furthermore <p x G C 1 (^4., V x ) and for all a = (A, u.,9) G A and all A = (A, M, rO) G T a .A, 
twt/i |G| = 1, we have 



||d Vx(^)IU = A 



A r /A /A M\ 

a' 2 V/I ~ a; ' 7*"/ 



(5.69) 



Proof: The expression (5.68) immediately follows from the expression (5.20) of the dis- 
tance d x . It is well known that the inverse map Inv : GL^ — > GL^ is C 1 and has the the 
following differential : for any (j) G GL^ and any $ G Mj 



ci Inv($) = 
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Since S : A — > ST is C 1 , as observed in the proof of Proposition 5.3.3, the composition 



Inv o S : A — > is also C 1 , and for any a = (A, u.,6) G A and any A G E x E x # J 

tZ a (Invo5)(A) = 5(a)- 1 (4S(i))5(a)- 1 . 

For any u G § we have 

In \\u\\ H(a) = In ^(a)-^!! = i In ((5(o)" V Sfa)-^)) . 
Therefore, again by composition, 

d (ln|M|«)(A) 



2|l«S(a)-%|P ((<S(arl(4<S(A))5 ' 1(a) ^' 5 ~ 1(a)M) 
+ (5- 1 (a)w, «S(a)- 1 (4«5(A))5- 1 (a)«}) 



where t> = 5 1 (a) , u. The function G has the following expression : for any a = (A, /i, 9) G 
R;xR* + x§ and any A = (A, M, r0) G E x E x 0- 1 , where |6| = 1, 



G(a, A) 



Siay^daSiA)) + (d a S(A))S(a) 



-l 



±ee T + -(id -^ T ) + -(--^) (#e T + 

A jj, 2 \/x A/ 



where we used the explicit expression (5.67) of d a S(A), and the fact that 9 = 



Choosing an orthonormal basis B of M. d which begins with the unit orthogonal vectors 9 
and 6, we obtain 



/ 



[G(a,A)], 



A 
A 



Z I A _ it 
r ( A _ m A M 

o^O 



o 

••• 



V 



o ••. y 



The map ip x is different iable, as the composition of differentiable maps, and d a (ip x (A)) is 
the element of V x := C°(S>,M) defined by u h-> rf a (ln ||w||^)(A). Therefore 



SUP l IMP 

||G(a,A)||, 



w G S, v = <S(a) 1 m | 



which establishes (5.69) and concludes the proof of this proposition. 



We now consider a fixed and we remark that 

dil z (iJ) x = di\ z (H o a ) x = dil z (y? x o a; d H ). 



5.3. Metrics having an eigenspace of dimension d — 1 at each point 



231 



Therefore, according to Lemma 5.2.4 for any norm || • \\ a t z \ defined on the tangent space 



xRx 0(z) to a(z) G A we have 



dilo(^ + o-0 a(z) ; || • \\ a{z) )di\ z (i) a ^ z) a] d H , || • \\ a (z)) 

< dil z (H) x 

< dil (y?+ o ip a ( z y, || ■ || aW ) dil^O^) o a; d H , || • || (*)) 



(5.70) 



provided no indeterminate x oo or oo x appears on the first and last term of this 
inequality. 

We first consider the case where X(z) ^ n{z). For each a' = (A', //, 6') G A we define a 
norm || ■ || a / on the tangent space RxMx 6' L to A at the point a 1 : for all A = (A, M, 0) G 

R x E x e r± 



max 



|A| |M| 1 
A' ' /i' ' 2 



// A 7 



101 



We have by construction 



dil 2 (^ a( 1 z) o a; d H 



> II ' 111(2)/ 



DJa) 



(5.71) 



According to Lemma 5.3.2 and Proposition 5.3.4 we have for any a' = (A', fjf, 6') G A and 
anyAeRxRx^ 1 



\A\\ a > < \\d a ,<p x (A)\\ c 



\d (<p x o^ a 



< 2|U|L. 



Since ip x is C it follows from (5.38) that 



1 < dilo(y? x o ip a ,] || ■ || a /) < dil (y? x ip a >] 



,A < 2. 



Combining (5.70), (5.71) and the last inequality we obtain 

D z {a) x <di\ z (H) x <2D z (a) x , 



which establishes the announced result (5.61) in the case X(z) 7^ l^(z). A similar reasoning 
establishes the counterpart (5.62) of this inequality for the distance d + . 

We now consider the case X(z) = f-i(z). For any e > and any a' = (X',^',9') G A 
such that A' = // we define a norm || • || a / jE on the tangent space KxRx # /_L to a' in A 
as follows : 

'|A| |M| 



a,e 



max 



e e 



A' ' n' 

This modification is required because the original norm || • || a / used in the case A' 7^ u! is 
only a semi-norm when A' = //. Reasoning similarly to the case X(z) 7^ fi(z) we obtain 
the upper bound 

di\ z (H) x < 2max{diL(ln A; d H ), dil 2 (ln/i; d H ), edi\ z (9; d H )} . 

Since X(z) = fi(z), the assumptions of the theorem state that di\ z (9; do) < 00. Since 
e > is arbitrary we thus obtain 



di\ z (H) x < 2max{diL(lnA; d H ), dil a (In/x; d H )} = 2D z (a) x . 
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We now turn to the lower bound on di\ z (H) x . For that purpose we define the functions 
7 := min{A,/i} and T := max{A,/i}. For all z, z' G Q we have according to (5.22) 

d x (H(z),H(z')) > max{|ln 7 (z) -lnj(z')\, \\nT(z) - \nT(z')\}. 

Therefore 

dil z (H) x = di\ z (H; d x ,d H ) > max{dil 2 (ln7; d H ), dil 2 (lnT; d H )}- 



Hence according to Proposition 5.2.9 



dil z (#) x > max{dil z (ln A; d H ), dil^ln^; d H )} = D z (o) x . 
We have thus obtained D z (a) x < dil 2 (if) x < 2D z (a) x . Proceeding likewise we obtain 



D z (a) + < di\ z (H)+ < 2D z (a) + , which concludes the proof of Theorem 5.3.1 
Our first corollary compares D z (a) x with the dilatation di\ z (8). 



Corollary 5.3.5. Under the hypotheses of Theorem 5.3.1 we have 

\\(z) - u.(z)\dil z (9) <2D z (a) x 



Proof: We have according to (5.35) 



\\(z)-U.(z)\dZ z (0) < '^rt W } L dil 

max{A(z), n(z)} 



< 



mm{X(z) , fi(z)} 
X(z) fi(z) 



1 dilM d 



H, 



fi(z) X(z) 
< 2DJa) v 



dil 



which concludes the proof. 



The next corollary shows how to construct a quasi-acute metric from a graded one. 
Let M e Sd and let U G Od and Ai, • • • , A^ G R be such that 



M = U T diag(Ai 



(5.72) 



where diag(Ai, • • • , Xd) denotes the diagonal matrix of entries Ai, • • • , A^. For any dGl 
we define the matrix max{a, M} G Sd as follows 

max{a, M} := U T diag(max{a, Ai}, • • • , max{a, Xd})U, (5.73) 

and we observe that maxjo, M} does not depend on the choice of the matrix U and on the 



order of the eigenvalues Ai 
and any s > one has 



• , Xd in the decomposition (5.72) of M. For any (A, /i, 6) G A 
max{s- 2 , n(a)} = min{s, A}~ 2 ## T + min{s, /i} _2 (Id -66^). (5.74) 
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Corollary 5.3.6. Let H G H be such that H(z) has an eigenvalue of multiplicity at least 



d — 1 for all z G Mr. We define s : 



and H G H as follows : for all z G M. d 



s\z 



inf \z - z'\ + \\H(z'y2\\, 



H'{z) := m&x{s(z)- 2 ,H(z)}. 
If He H 9 , then 4?H' G H a . 

Proof: We first observe that s : M. d — > M* + is Lipschitz and that s(z) < \\H(z)~z\\ for all 
z G IR d . For each e > and each z G M. d we define 



He(z) 



ma^{e 2£ s(z)- 2 ,H(z)}. 
z) is pr< 

H(z)-%\\ > s(z), then H £ = e 2e ld/s 2 on a neighborhood of z. This implies 
dil 2 (# £ ) + = di\ z (e- E s) < e~ e < 1. 



Consider a fixed point z G M d . If is proportional to Id, which means that 

\\H(z)\\-i 
according to (5.45) 



di\ z (H £ ) y 

On the other hand if H(z) is not proportional to Id then there exists a G C°(R d , A), 
a(z) = (X(z), /j i (z),8(z)) such that H = H o a on a neighborhood of z, and D z (a) x < 1 
according to Theorem 5.3.1 We have according to (5.74) on the same neighborhood of z, 

H £ = Hoa £ = min{e- £ s, A}~ 2 ## T + min{ e - £ s, /i}' 2 (Id -99^), 

where a £ = (min{e~ e s, A}, min{e~ e s, /i}, 9) G C°(IR d , ^4). 

We define H\ := e 2s Id / s 2 and we observe that H £ > H\ and H £ > H, hence for any 
p, q G R d 

d He (p,q) > max{d H (p,q), dm(p,q)}. 



It follows from Proposition 5.2.5 that 

dil z (lnmin{e~ £ s, A}; dn E ) < max{dil z (ln(e _£ s); d H J, dil 2 (In A; dn e )} 

< max{dil z (ln(e _£ s); d H i), dil z (lnA; dn)} 

< max{dil 2 (e~ e s), dil z (H) x } 

< max{e~ £ , 1} = 1, 



where we used Proposition 5.2.7 and Theorem 5.3.1 in the third line. Proceeding likewise 
for the other eigenvalue of H £ we conclude that 

dil 2 (lnmin{e~ e s, A}; dnj < 1 and dil 2 (lnmin{e~ e s, /i}; dn s ) < 1. (5.75) 

One easily checks that 

min{e~ e s, A} min{e~ e s,/i} 



min{e~ £ s,/i} min{e~ £ s, A} 



< 


A 


/' 






A 
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using that t \— > t/[i — ///Ms increasing on IR1. Therefore 



min{e £ s, A} min{e £ s,fi} 



min{e~ £ s, /i} min{e~ £ s, A} 



dil*(0; d H J < 



A /i 
// A 



diU(0; < 1. 



Combining the last estimate with ( |5.75[) we conclude that D z (a £ ) x < 1 and therefore 



di\ z (H £ ) x < 2 according to Theorem 5.3.1 



Our next objective is to estimate di\ z (H s ) + . Let us assume that X(z) < fx(z), then 

dil z (min{e~ e s, A}) < max [dil z (e~ £ s), dil z (A)} 

dil 2 (A; d H ) 



< max < dil,(e £ s) 



max < dil z (e £ s), 



mm{X(z) , fi(z)} 
dil z (A; d H 



X(z) 

< max{e~ e , dil 2 (lnA; du)} 

< max{e~ e , 1} = 1, 



where we used (5.35) in the second line. Still assuming that X(z) < fi(z), we have s(z) < 
\\H(z)~2\\ = fi(z) by construction of s, which implies that min{e _e s,/i} = e~ £ s on a 
neighborhood of z, and therefore dil z (min{e~ £ s, //}) = dil 2 (e _e s) < e~ £ < 1. Proceeding 
similarly if X(z) > fi(z) we also obtain 



dil 2 (min{e £ s, A}) < 1 and dil z (min{e £ s,fi}) < 1. 



(5.76) 



We now focus on the local dilatation of the orientation 6, and for that purpose we observe 
that 

\ mm{e~ £ s(z), X(z)} — min{e~ £ s(z), fi(z)}\ < \X(z) — fi(z)\. 
Therefore according to Corollary |5.3.5| 

\mm{e- £ s{z),X{z)} -min{e' £ s{z),fi{z)}\dil z (e) < 2D z (a) x < 2 

Combining this with (5.76) we obtain D z (a £ ) + < 2 and therefore di\ z (H e ) + < 4. 

The above arguments show that di\ z (H £ ) x < 2 and di\ z (H £ ) + < 4, which implies that 
2 2 H £ G Hg and 4 2 H £ G H a according to Proposition 5.2.5 and Remark 5.1.13 Hence for 
any z,z' G E d , since H £ < e 2e H', 

d x (H £ (z) } H £ (z')) < 2d He (z,z') < 2e £ d H/ (z,z') and d + (H £ (z), H £ (z')) < 4\z - z'\. 

Letting e — > we obtain 

d x (H'(z),H'(z')) < 2d H ,(z,z) and d+(H'(z), H'(z')) < 4\z - z'\. 



Therefore 4 2 H G H a according to Remark 5.1.13 which concludes the proof of this corol- 
lary, o 
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5.4 From mesh to metric 

The techniques presented in this section show how to produce an metric ff £ Hj, H a 
or H g from an isotropic, quasi-acute (in two dimensions), or graded mesh T. These results 



are summarized in Proposition 5.4. 1| below. 



Combining this proposition with its counterpart Proposition 5.5.1| in the next section 



on the generation of a mesh from a metric, yields the main result of this chapter, Theorem 



5.1.14, which states the equivalence of the classes T i:C C T a C C T g ,c of meshes and 
H.j C H a C Hg of metrics. 

Proposition 5.4.1. For any Cq > there exits C = C(Co,d) such that the following 
holds : 

i) For any T G Ti } c there exists H G Hj which is C -adapted to T ■ 

ii) If d = 2, then for any T G T a ,c there exists H G H a which is C -adapted to T ■ 
Hi) For any T G T g ,c there exists H G H 9 which is C -adapted to T ■ 

The rest of this section is devoted to the proof of this proposition. For any mesh T G T, 
and for any vertex v of T, we denote by n-y(v) the number of simplices in the mesh T 
containing v : 

nr{v) := #{TgT; veT}. 

We denote by Hj- G H the piecewise linear metric on the mesh T which satisfies for each 
vertex v of T 

' v ' TeT 
s.t. ueT 

The next lemma establishes a property of the barycentric coordinates on a simplex, 
which is useful for the analysis of the regularity of a piecewise linear function on an 
anisotropic mesh, such as the metric H^-. 

Lemma 5.4.2. Let T be a simplex and let V be the collection of vertices ofT. Let (A„)„ g y 
be the barycentric coordinates on T. Then for all z,z' G T 



/m2 _ d + \ 
~ lnT ~ d ( 



I--- --'111,. ---- ^— ^2\X v {z) - \v{z')\ 2 . (5.78) 



Proof: We first assume that T is the reference equilateral simplex T eq . For any two vertices 
v, v' in the collection V eq of vertices of T eq one has 

<«■"'>={ -In Vv7*. (5 - 79) 

Indeed \v \ = 1 by construction for any vertex v of T eq , and for any v' G V eq distinct from 
v the scalar product (v,v') has a value a independent of v and v' by symmetry. Since T eq 
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is centered at the origin we obtain = (v, Ylv'ev v ') 
We thus obtain for any z, z' G T oq 



da which establishes (5.79) 



\z — z 



'|2 



2j {Xv{z) - \v{z'))v 

\K(z) - K(z')\ 2 - I ^(^(z) - A„(z'))(A^(^) - 



Therefore 



L2 — -2 



/|2 



(i + i) £ \M 



z) - \ v (z 



t\\2 



z) - A„(z'l 



which concludes the proof in the case of T eq since "Hy = Id and since the barycentric 
coordinates of z and of z' sum to 1. 

We now consider an arbitrary simplex T and an affine change change of coordinates 
$ such that $(T) = T cq , $(2) = (f)z + z where G GL d and z G lR d . We have "H T = T </> 



according to (5.5), hence for all z, z' G T 

mz)-Mz')\ = \ 



[z — z 



'■ \\ z — z \\Ht- 

Furthermore the barycentric coordinates of z and z' in T are the same as those of $(z) 



and $(z') in $(T) = T eq . This implies (5.78) and concludes the proof of this lemma 



o 



The next proposition immediately implies Point hi) of Proposition 5.4.1 Indeed for 
any mesh T G T g ,c the metric dC^Hj- belongs to H g and is C^y/d equivalent to T. 

Proposition 5.4.3. For any TgT 9 c the metric Hj- is Co-adapted to T, and dC^Hj- G 
H s . 

Proof: We consider a simplex T G T, and we recall that for any neighbor X" of T one has 



Averaging these matrices as in (5.77) we obtain for any vertex v of T 

C 2 U T < H 9 r {v) < ClU T . 



(5.80) 



Averaging with respect to the barycentric coordinates of a point z G T we obtain 

C 2 H T < H 9 T (z) < C 2 H T} (5.81) 
which establishes as announced that the metric Hj- is Co-adapted to T . 
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We now establish the regularity property of H!j-, and for that purpose we consider two 
points z,z' G T and a vector u G W 1 \ {0}. We thus have 



\ U Wh 3 t {z>) 



Evev X v( z )W u W 2 HUv 



J2vev X v(z')\\u\\ 2 H , {v) 



1 + 



< i + c n 4 



u 



E, 6 y ^OIMI^) 
E^ey A„(z') 



< 1 + C 4 /(d+l)^|A v (^)-A„(z 



>\\2 



i + c Vd|| 



uey 



z-z \ nq 



5.4.2 



where we used (|5.80|) in the third line, the Cauchy-Schwartz inequality in the fourth line, 
and Lemma 



in the fifth line. Proceeding similarly for ||u||#s (^/HwIIh 9 (z) we obtain 



d x {H g T {z),H 9 T {z)) = max 



T (z>), 

Ml u ILfff(*) -ln||w|| H s (z0 



< i]n(l + C*Vd\\z-z'\\ HT ) 



< 



\z-z 



Hence according to (5.81) and (5.35) for all z G int(T) 

diL;(#f) x = dil,(#f; d x , || • || HW ) < C di\ z (H°-; d x , || • ||« T ) < C 5 Vd/2. 



If follows from Corollary 5.2.6 that dil z (Hj-) x < C\\fdj'l for all z G M. d , and from Remark 



5.1.13 that (dCy /4)H G H 5 , which concludes the proof. 



o 



We associate to any mesh T G T a metric FL % r defined as follows : for all z G M. d , 



W T {z) 



\H 9 r {z)\\ld. 



The next result immediately implies Point i) of Proposition 5.4.1 Indeed for any mesh 
T G Tj c the metric dC^Hlj- belongs to Hj and is C^y/d equivalent to the mesh T . 

Corollary 5.4.4. For any mesh T G T$ c the metric H % r is C^-adapted to the mesh T , 
anddCl Q H l T eHi. 



Proof: We have for any simplex T G T 



<V ||ft T ||Icl < 



\H 



T 



Id= H^II^Id < H T < ||H T ||Id 



(5.82) 
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For any point z G T we obtain since C 2 H^-(z) < Ht < CqHj-(z) 

*-2|| it 9 TJ / ||o/_ll M ^ 2 II W9 



Co^Hflf (z)|| Id < \\Ht\\ W < Collar (*) 



ClH\{z). 



Combining this with ( |5.82 ) we obtain 



C^H^(z) <H T < ClW T {z) 



which establishes as announced that the metric Hj- is Cq equivalent to the mesh T . 

According to Proposition 5.4.3 we have dC^HS- e H s . Corollary 5.2.8 thus implies 
that dC^H % r G Hj, which concludes the proof of this proposition. o 

Before turning to the case of quasi-acute metrics we prove a lemma on the neighbo- 
rhood of a simplex in a graded mesh. For any mesh T G T and any closed set E C M d 
the neighborhood Vj-(E) of E in T is defined as the union of all simplices T G T which 
intersect E : 

V T (E):= |J T. 
Ter 

Proposition 5.4.5. Lei T G T 9jCo and Zei T G T. For all x G T and a// y G JR d \ V T (T) 
we have 

\\x - vWht > (CoVdy 1 . 



Proof: We consider the function if : 



which is piecewise linear on the mesh T, 



and which satisfies for any vertex v of the mesh T 

(p(v) = 



1 if v G T, 
if v £ T. 

We denote by V the collection of vertices of a simplex T" G T, and by (A„(2;))„ 6 y the 
barycentric coordinates of a point z G T'. We have for any z,z' £ T' 



\(p(z) - ip{z')\ < ^2 \X v (z) - X v (z')\tp(v) 



= Vd\\z - z'\\ n 



A,(z')l 2 



Since ip is identically on T" if T fl T" = 0, and since "Hjv < C^Ht otherwise, we obtain 
for any z,z' G X" 

\(p(z) - (p{z')\ < C Vd\\z - z'\\ Ht - 
Therefore dil z (</?, || • \\n T ) f° r an y T' E T and any z G T", w hich implies that </? : 



is C*o"\/rf-Lipschitz according to Corollary 



y G E d \ Vr(X) we have <^(x) = 1 and <p(y) = 0, thus 1 
which concludes the proof of this proposition. 



5.2.6 



For any x G T and any 
(x) - <p(y) < C - v\\h t 



5.4. From mesh to metric 



239 



We assume from this point that the dimension is d = 2. For each triangulation T G T 



we define a function s T : 



and a metric H9j- e H as follows : for all zeR 2 



s r (z) 



inf + II HUz'^ 



TV 



(5.83) 
(5.84) 



where the maximum of a real and of a symmetric matrix is defined at (5.73). The next 
proposition immediately implies Point ii) of Proposition 5.4.1 Indeed the metric 32Cq°H^- 
belongs to H a and is \/32CCq adapted to the mesh T ■ 

Proposition 5.4.6. For any Cq > 1 there exists C = C{Cq) such that the following holds. 
For any mesh T G T a ,c the metric H^- is C- adapted to the mesh T , and 32Cq°H^- G H a . 

The fact that 32C 10 iJf G H a directly follows from the fact that 2C 10 H^ G H g and 
from Corollary 5.3.6| 

We define dq-(z) := \\Hj-(z)~2 1| for each z G M 2 . We establish below that there exists 
a constant r\ = r)(C ), < r\ < 1, such that for all z, z' G M. 2 

if \z' - z\ < r)d r {z) then d T (z') > rjdr(z) . (5.85) 

Before turning to the proof of this property we show how it leads to the proof of Propo- 
sition |5.4.6l For all z G M. d we have 

Sf{z) = inf \z — z'\ + df(z ) > inf min{|2; — z'\, d-j-(z')} > r]d-f(z'), 



z'eir 



z'eR 2 



indeed \z — z'\ > r/df(z) or dj-(z') > i]d-j-(z) for any z' G M. d according to (5.85). Combining 
this with the definition (5.84) of Hj- we obtain for all zGl 2 

rfH^(z) < E g T {z) < H^(z). 



Recalling that Hj- is C -adapted to the mesh T, see proposition 5.4.3, we obtain for all 
T G T and all z G T 

rfC^Hftz) < C 2 H^z) <H T < C*H°-(z) < C 2 H^z) 

which establishes that the metric H^- is Cq/tj adapted to the mesh T, and concludes the 



proof of Proposition 5.4.6 



For notational simplicity we denote from this point H := Hj- and C\ := \/2Cq, hence 
CfH G Hg and T is C adapted to H. It follows from (5.10) that 



Co- 1 !!^)-^!! < \\H T *\\ < diam(T) < 2||H T 5 || < 2C \\H(z) 
for any T G T and z G T, which implies 

diam(T)/(2C ) < d T {z) < C diam(T). 



(5.86) 
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(0 + 1/ 



CO — M 



CO + W 



(0- u 



CO+M 




co— u 



Figure 5.3 - An edge e and the set 0(e) (left), sequence (z n ) n > (center), diameter of a 
triangle intersecting 0(e) (right). 



The quantity dj-(z) should therefore be heuristically regarded as the diameter of the 



triangle T 6 T containing z. Property (5.85) heuristically states that the diameters are 



constrained to vary in a Lipschitz manner in a quasi-acute triangulation. 



We now turn to the proof of (5.85), and for that purpose we introduce some notations. 
For each u G M 2 , we denote by u 1 - the vector obtained by rotating u by 7T/2 in direct 
trigonometric orientation. For each segment e = [u — u, u + u] C IR 2 we define the diamond 
0(e) of e as follows 



0(e) := {z G 



[z — u,u)\ + C \(z - uo,u L )\ < \u\ 2 }. 



This set is illustrated on Figure 5.3 (left). 

The next lemma considers a triangulation T' on which the measure of sliverness is 
uniformly bounded by Co, and compares the length an edge e of T 7 with the diameter of 
the triangles in T' which intersect 0(e). 

Lemma 5.4.7. Let T 1 G T be a triangulation such that S(T) < Cq for all T G T 1 . Let 

e = [u — u, u> + u] be an edge of T' . Then there is no vertex of T' in the diamond 0(e), 
and for all z e 0(e) and any triangle T G T' containing z, we have 



diam(T) > \u\ — Cq\ (z — oj, u A 



u 



Proof: We denote by u x := (1,0) and u y := (0,1) the canonical basis of 1R 2 , and we 
assume up to a translation and a rotation that u> = and u = u y . We recall that for any 
triangle T with maximal angle 9 one has S(T) = max{l, tan(#/2)}. 

We assume for contradiction that there exists a vertex vo = (xo,yo) G 0(e), and we 
assume without loss of generality that x < 0. We consider a vertex v\ = (x\,yi) of T' 
such that (vi — v , u x ) > and such that the angle 



do ■= <(vi - v , u x ) 



arccos 



(vi - v ,u x ) 
\vi - v \\uJ 



has the smallest possible value. By construction there exists a triangle T G T containing 
Vq and V\ which has an angle larger or equal or equal than 28 at the vertex vq. It follows 
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0(e) 



Figure 5.4 - An edge e, the diamond 0(e) and the rectangle Og involved in Lemma 5.4.8 
(left), the rectangles 0(e) and 0§. 



that tan^o < S(T) < Co, and therefore 

\yi-Vo\ < (xi -x Q ) tan 6» < C (xi - x ) 



Proceeding inductively, as illustrated on Figure 5.3 (center), we define a sequence v n = 
(x n ,y n ), n > 0, of vertices of T satisfying \y n +i — y n \ < Co(x n +i — x n ). Adding these 
inequalities together we obtain 



Vol < C Q (x n - Xq). 



It follows that one of the edges [v n ,v n+ i], n > 0, of the mesh T' intersects the edge e, 
which is a contradiction. As announced the diamond 0(e) therefore does not contain any 
vertex. 

We now consider a point z = (xq, yo) G 0(e) and we assume without loss of generality 
that xo > 0. We consider a triangle T containing z , and an edge e' = [u/ — u' + u'] of 
T which is on the left of z (in the sense that the edge e' and the segment [(0, yo), (xo, yo)] 
joining the point z to its orthogonal projection on the edge e intersect). The point z and 



the triangle T are illustrated on Figure 5.3 (right) 



Since there is no vertex of T' in the diamond 0(e), the edge e' intersects the boundary of 
0(e) at two points. We denote these points by (x,y) and (x',y'), and we assume without 
loss of generality that y > > y' . Observing that < x < Cq 1 , < x' < Cq 1 and 
min{x,x'} < Xq we obtain 

y _ y > = (i _ c x) - (-1 + C x') > 1 - C x . 

Since u = u y = (0, 1), cj — and 2u' = X(x — x',y — y') for some |A| > 1, we obtain 

diam(T) = 2\u'\ > 2\{u,u')\ = y — y' > 1 — CqXq = \u\ 2 — Co\(zq — u,^)], 



which concludes the proof of this proposition. 



o 



From this point we denote by T' a Co-refinement of the quasi-acute mesh T G T O) c , 
which satisfies S(T) < Cq for all T G T'. The next lemma involves a rectangular set 0(e) 
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associated to an edge e of T', illustrated on Figure K4] (right). Note that e C int(D(e)), 
while e (£ 0(e). 



It follows from (5.22) and Proposition 5.2.10 that for all z,u& 

d T (z + u) > (1 - Ci\\u\\ H {z))dr{z)- 



(5.87) 



Lemma 5.4.8. There exists a constant 8 = d~o(C ,d) > such that the following holds. 
For each edge e = [u — u, u> + u] ofT', let D(e) be the rectangle defined by 

□ (e) = { z eM 2 ; \(z-u,u)\ < (1 + 8 )\u\ 2 and \(z-u,u x )\ < 8 Q \u\ 2 }. 

One has dq-(z) > 8q\u\ for all z G D(e). 

Proof: We denote by u x := (1,0) and u y := (0,1) the canonical basis of R 2 . Up to a 
rotation and a translation, we may assume that u = and u = u y = (0, 1). We consider a 
triangle T 6 T containing the edge e (note that e C T but that e may not be an edge of 
T, since e is an edge in the triangulation T' but may not be an edge in the triangulation 
T). It follows from fl5~86| that 

d T (z) > diam(T)/(2C ) > diam(e)/(2C ) = Cq 1 
for all z G T, hence for all z G e. Hence for all z G e and all m G M^, we obtain using 

rfr(^ + w) > (1 -CiNko^Co -1 > (1 -C a Co||u||« T )C^ 1 . 
= 4C Ci and we observe that, since HiiJI^ < 1, any point z' of the 

-1 1 



( p7) 

We define C 2 
rectangle 



a; ma 



C 2 \\u x \ 



Hi 



X 



[-(l+C^ 1 ), 1 + C^ 1 ]. 



(5- 



can be written under the form z' = z + u where z G e and ||m||-h t < l/(2CoCi), and 
therefore d T {z') > 1/(2C ). 

We introduce a small constant 5 = 5(Co) > which is specified later in the proof of 
this lemma. It follows from the above argument that the proof of this lemma is complete 
if H'HtII ^ ^ 2 > with S := min{<5/C 2 , 1/(2C )}. We thus assume from this point that 
\\Ht\\ > 5~ 2 . 

We assume that 5 < C X and we define the rectangle 

n s :=[-8,S\x[l-C 8,-l + C S\, 

which is included in 0(e) by construction. Since C 2 H G H s , the function z h-> \\H(z) 1 1 i 
is Ci-Lipschitz according to Corollary 5.2.8 We thus have for all z = (x, y) G 

11^(2)11-3 < \\H(0,y)\\-* +d\x\ < C \\n T \\-^ +Ci\x\ < r (8) := (Cq + CJS. (5.89) 



On the other hand it follows from Lemma 5.4.7 that any z = (x,y) G is contained in 
a triangle T' such that diam(T') > (1 — Cq8), hence according to (5.86) 

d T (z) > diam(T')/(2Co) > R (8) := (1 - C 8)/(2C ). (5.90) 
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For all z G □,$, we denote by 9(z) = (cos Q(z), sin Q(z)), Q(z) G [0,7T], the direction of 
the eigenvector associated to the small eigenvalue of H(z), in such way that 



l|0(*)ll*M 



\H(z)s\ 



d T (z) 



(5.91) 



According to (5.87) we have for all z G □$ and all r G R 

d r (z + rO{z)) > d r (z)(l - CM/driz)). 
For all zeDj and all r G R such that |r| < i?i(5) := Rq{S)/{2Ci) we thus obtain 

d r (z + r9{z)) > R (5)(l - d|r|/i2o(*)) > Ro(8)/2. (5.92) 
We have for any (0, y) G e 

C > C |Klk T > 1 1 Uj, 1 1 #(0,3,) > |cos0(O,y)|Vll^(O,y)|| > | cos 9(0, y)\/(SC ), 



(5.93) 



hence | cos 0(0, y)\ < Cq5. For all z G Og we have according to Corollary 5.3.5 

(R (S)-r (S))di\ z (e) < (||F(z)-||| - ||#(z)||-i)dil z (e) < 2d. 
Hence we obtain for all z — (x,y) G Dg 

{Ro{8)-r {8))\Q{z)-e(0,y)\ <2C 1 \x\ <2C 1 5, 
hence since t > cost is a Lipschitz function 

| cos B(z) | < cos 00(5) := Cl5 + - 2Cl< * 

Ro{°) - r {S) 

It follows that dj-(z') > Rq(5)/2 for all z' in the set 

{z + f6{z) ; z G |f | < i?i(<5)} D [-<5 , <*o] x [-(1 + /i ), 1 + /i ] 

where 

<5 := 5 — r cos Qq(5) and /i := — C 5 + r sin O (5) (5.94) 

and where the constant < r < R\(S) can be freely chosen. The rest of the proof consists 
in choosing appropriate constants S and r. 

As 5 -)■ we have r (<5) -» 0, i?o(5) ->■ V( 2 Co) and cos6 (<5) ->■ 0. We thus choose 
5 = 5(Co) > sufficiently small in such way that 

Ro > 1/(4C ), cos6 < 1/(8C ), sin6 > 1/2. (5.95) 

and such that r : = 4C <5, is smaller than Ri(6) = Rq(8)/(2Ci). Injecting this choice of r 



and (5.95) in (5.94) we thus obtain 

<5o > 8/2 and /x > Cq& 
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This establishes under the hypothesis ||"Ht|| > S 2 that df(z) > l/(8Co) for all z in the 
rectangle 

[-5/2,5/2] x [-{1 + C 6),1 + C 6\, 
which concludes the proof of the proposition. o 

Our next intermediate result establishes a technical property on the covering of a 
segment by smaller ones. 

Lemma 5.4.9. Let X\, ■ ■ ■ ,Xk £ [0, 1] and let hi, - - • ,h/~ > be such that 

[0,1] C |J [Xi-hi, Xi + hi}. (5.96) 

i<j<fc 

Let 5 > and let h* = h*(k,5) := exp(l — 2k/5)/5. Then 

[0,1] C |J [xi-{l + 5)h h Xi + (1 + S)hi]. (5.97) 

l<i<fc 
s.t. hi>h* 

Proof: For each y £ [0, 1] we denote by h(y) the value hi associated to the segment 
[xi — hi, Xi + hi] of smallest length containing y. We thus have 

dy f x > +h * 1 

Ki<Jfc ,/!B »" 



[0,1] Ml/) iTlluJxi-hi h 



We assume for contradiction that (5.97) does not hold, and we consider a point x £ [0, 1] 
which does not belong to the right hand side of [0, 1]. For each y £ [0, 1] there exists i, 
1 < i < k, h(y) = hi, such that y £ [x^ — hi, Xi + hi] and x [xj — (1 + <5)/ij, Xj + (1 + 
Therefore one of the two following possibilities holds 

5h(y) < \x — y\ or h(y) < /i*. 

Therefore /i(t/) < max{|x — y\/5, h*}, which implies 



/i(y) 7 max{|x /i*} Jo h* J Sht y 

Note that the second inequality is an equality if x — or 1. Thus 2A; > 5(1 — ln(5h*)), 
which contradicts the definition of h*. This concludes the proof of this lemma. o 



Observe that under the hypotheses of Lemma 5.4.9 we have 



[-hj, l + hj]c [J [xi - {1 + 25)hi, Xi + {1 + 25)hi]. (5.98) 



l<i<k 
s.t. hi>h t 



We now conclude the proof of Proposition 5.4.6 We consider a point z £ 1R 2 contained 



in a triangle T E T, and we denote by e the longest edge of T. Up to a rotation and a 
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! e2 i 


23 


e 4 



hi < h, 



□(«*) 



Figure 5.5 - A triangle T e T and the edges (ej)i<j<jt of T' contained in T (left), the 



set (5.99) (right). 



translation, we may assume that e = [0, u x ] joins the origin of 1R 2 to u x := (1, 0). Since the 
triangulation T 7 is a Co-refinement of T, the edge e of T is built of k < Co edges (ei)i<»<fc 

of r, 

e* = [(xj - /ii)^, (xi + hi)u x }. 
The triangle T and the edges (ej)i<i<& are illustrated on Figure 5.5 (left). 



We define h* := h*(Co,5o/2) = 2exp(l — 4Cq/So)/Sq, where Sq is the constant from 



Lemma 5.4.8, and we obtain using (5.98) that 



U Q ( e *) D 



h*5o h*5o 
^ 1 1 ~t~ ^ 



(5.99) 



This set is illustrated on Figure 5J)] (right). Since e is the longest edge of T, the angles of 
T at the extremities of e are acute. Furthermore the height h of T orthogonal to the edge 
e = [0, satisfies 

IT 1 I IT 1 1 1 

I -L eq | I - 1 ea Ml 7T>t I 



V 2 = \T\ 



T, 



cq| 



Vdet U T 



,,,,, ,, T 

eq|||/i- T || |-teq| 



4p(T) ' 



where we used that \\H T 2 1| > diam(T)/2 > 1/2 according to (5.10). Hence we obtain with 



c := |T cq |/8 the inclusion 



Tc [0,1] x i-c/p(T),c/p(T)] (5.100) 
We distinguish two cases depending on the value of p(T). If p{T) > p := 2c/(/i*5o) we 



obtain combining (5.99) and (5.100) that dj- > h*5o on the set 

2 



Z + 



Since d-j-(z) < Codiam(T) = Co this implies (5.85) with rj = h*5o/{2Co). One the other 
hand if p(T) < po, then 

C 2 oPo 



\H(z)p < C \\n T p < PcAU^tI < PoCZWHiz) 



II— 1 



< 



d T (z) 
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Figure 5.6 - A tridimensional triangulation generated by the isotropic refinement proce- 



dure exposed in 77 



which implies for any vector «6l 2 

dr{z + u 



d T {z) 



> I -CM 



H(z) 



> 1 



C*\u\ 
d T (z) ' 



where C* := C\PqCq. We thus obtain (5.85) with the constant rj := 1/(2C*), which 



concludes the proof of Proposition 5.4.6 



5.5 From metric to mesh 

This section is devoted to the proof of the following result, on the generation of a mesh 
from a metric. 

Proposition 5.5.1. There exists Cq = Co{d) such that the following holds : 

i) For any H G Hj there exists T G Tj c which is Co-adapted to H . 

ii) If d = 2, then for any H G H a there exists T G T a which is Co-adapted to T . 
Hi) If d = 2, then for any H G H g there exists T G T g which is Co-adapted to T. 



Point i) is established in £5.5.1, Point Hi) in £5.5.2 and eventually Point ii) in £5.5.3 



Remark 5.5.2. Let * G {i,a,g} be a symbol, and let 1 > c > and C > 1 be numerical 
constants. Assume that for any metric H such that c 2 H G there exists a mesh T G T+c 
which is C adapted to H. Then for any metric H' G there exists a mesh T' G T^c 
which is C / c-adapted to H' . (Indeed, choose a mesh T' which is C -equivalent to the metric 
H'/c 2 , and observe that T' is C j c- equivalent to H' .) 



5.5.1 Isotropic Mesh generation 



The survey paper [77] describes a the construction of a hierarchical family of meshes 
of the rectangular domain [0, l] d . All these meshes are the refinement of a "fundamental" 
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mesh 7o = {T a ; o G S^}, where stands for the collection of permutations of the set 
{1, • • • , d}, and T a for the Kuhn simplex 

T a := {ze[0,l} d ; z a{1) < ■ ■ ■ < z a{d) }. 

Starting from 7o, the algorithm described in [77] produces, guided by the user, a family 
(T n )n>o of conforming meshes of the cube [0, Tf proceeding as follows. Assume that the 
mesh T n has already been generated. 

- (Marking) The user selects a subset M. n C T n . 

- (Refinement) The algorithm generates a 2-refinement T n +i of 7^, 

T n +i ■= Refine(7^, M n ). 

which does not contain any of the marked simplices : T n +i H M. n = 0. 
The elements of the meshes (T n ) n >o have a very specific structure, which main features 
are described below. 



Property 5.5.3. Any simplex T produced by this algorithm satisfies the following pro- 
perties. 

a) (Discrete Volumes) The volume ofT has the form 

o-9(T) 

\T\ = -z-, (5.101) 

where g{T) > is an integer called the "generation" of T . 

b) (Finite number of classes) The simplex 

b(t) 

2V(T-*r), 
belongs to a finite family To of simplices. 

The survey paper [77] also contains a result, Proposition 4.1, which establishes that 
T n +i '■= Refine (T n , M. n ) is a local refinement of T n in the following sense : for all T' G 
Tn \ T n +i there exists T G M. n such that 

g{T)>g{T') and d(T, T') < D 2~ 9 ( T '^ d , (5.102) 

where D = D(d) is a fixed constant and where 

d(T,T') := mm{\z - z'\ ; z G T, z' G T'}. 

The next lemma shows that this refinement algorithm can be used to produce a confor- 
ming isotropic mesh T of the unit cube [0, l] d , such that the diameters diam(T), T G T, 
are specified by a given Lipschitz function. 
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Lemma 5.5.4. Let s : [0, l] d — > (0, \/d] be a c-lipschitz function, where < c < 1. Let 
(T n ) n >o be the sequence of meshes produced by the algorithm when the collection Ai n C T n 
of marked simplices is defined as follows : 

M n := |r G T n ; nuns < diam(T)| . (5.103) 

Then the sequence of meshes stabilizes : there exists N > such that T n = Tn for all 
n > N . Furthermore we have for all T G Tn and all z G T 

diam(T) < s(z) < # c diam(T), (5.104) 

where 

^ 2 , 0+(1 + c)+c0 (5ios) 

in which D + := max{diam(T) ; T G To} and D_ := min{diam(T) ; T G To}. 

Proof: We first show that for any n > any T* G T n , and any z G T we have s(z) < 
K c diam(T„,). Indeed if T G 7o then 

s(z) < Vd = diam^T) < K c diam(T„,) 

since K c > 1 and since s is uniformly bounded by \/d. 

Otherwise T* has a father T' which was bisected in the refinement process, hence there 
exists a simplex T such that (5.102) holds and which was selected for bisection at some 
stage of the algorithm. 

We thus obtain for any z£T, since g(T) — 1 = g(T') > g(T) and min^s < diam(T) 

s(z) < nun s + c(diam(T') + d(T, T')) 

< diam(T) + c(diam(T') + d(T, T')) 

< D + 2-^' d + c{D + 2-^' d + D2~^ T 'y d ) 

< 2-^ T ^/ d (D + (l + c) + cD) 
= K c D.2- 9 ^ d 

< K c diam(T), 



which concludes the proof of the right part of (5.104). 

From the fourth line of this inequality we also obtain an uniform lower bound on the 
volume of the simplices generated by the refinement procedure 

2~9( T *) 1 / min{s(z) ; z G [0, l] d } 

-Z* — ^ Co '. — — I — ^ 

d\ ~ d\y 2i{D + {l + c)+cD) 

It follows that H^iTn) is uniformly bounded by 1/co, which immediately implies that the 
sequence {T n ) n >o of meshes stabilizes. 

Let N be such that T n = Tn for all n > N. We thus have M. N = and therefore 
diam(T) < minj- s for all T G Tn, which implies the left part of (5.104) and concludes the 
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proof of this lemma. 



o 



The rest of this section is devoted to the proof the point i) of Proposition 5.5.1 We thus 



consider a metric H G Hj, and we recall that there exists a Lipschitz function s : 
such that for all z G W 1 



— > 



H(z) 



Id 



slz 



For each n > we define a function s n : [0, l] d — > El, which is clearly Lipschitz, as follows 

s n (z) :=2- n s(2 n (z-z )) (5.106) 
, 1/2) is the barycenter of the cube [0, l] d . For all z G [0, l] d and all 

s n (z) < s n (z ) + \z- z \ < 2~ n s(0) + — . 



where z = (1/2 
n > we have 



The function s n is therefore uniformly bounded by y/d on [0, l] d when n is sufficiently 
large. We denote by T n the mesh of the cube [0, l] d described by Lemma 5.5.4 for the 
function s„. We thus have 



diam(T) < s n (z) < Cdiam(T) for all TeT n ,zeT. 



(5.107) 



where C = C(d) is the constant from Lemma 5.5.4 We denote by T n the mesh of the 
cube [— 2 n_1 , 2 n ~ 1 ] d obtained by translating the mesh T n by — zq and dilating it by 2 n . In 
mathematical terms : 

T„:= {2 n (T-z ); TeT n }. 
In view of (5.106) and ( |5.107 ) we thus have 

diam(T) < s(z) < K x diam(T) for all T e %, z G T, (5.108) 

We denote by Cq the smallest constant such that for all T in the finite set Tq one has 



CTld 



< U T < 



C 2 Id 



diam(T) 2 ^ '"^ J^diam(T) 2 ' 
and we thus obtain for all T G 7^ and all z G T 

C 2 H(z) <H T < ClH{z). 



(5.109) 



Unfortunately the mesh 7^ does not cover R d but only the cube [-2 n ~\ 2 n ~ 1 } d . The next 
lemma shows that a global mesh T of the infinite domain K. d can be extracted from the 
sequence of meshes (7^)„>o- 

Lemma 5.5.5. Let (fl n )n>o be an increasing sequence of polygonal domains ofM> d , which 
exhausts Mr. In other words 



and Q n C fi n +i for all n > 0. 



n>0 
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Let H G Hg, let (T n ) n >o be a sequence of conforming simplicial meshes of the domains 
Q n . Assume that for all n > 0, all T G T n and all z G T one has 



Co 2 H(z) <H T < ClH(z) 



(5.110) 



Then there exists a mesh T G T such that C 2 H(z) < Ht < CqH(z) for all T G T and 
all z G T. 

Proof: The heuristic idea of this proof is to "extract a converging subsequence" from the 
sequence (7~ n ) n >o of meshes. Unfortunately meshes are discrete objects of combinatorial 
nature, and the convergence of meshes has no meaning a priori. We therefore use Lipschitz 
functions as an intermediate object, because their convergence is well defined and they 
benefit from a compactness property. 

We associate a function 0^ : IR d — )• [0, 1] to each simplex T, which is defined as follows. 
The function 0t is supported on T, and for all z G T we have in terms of the barycentric 
coordinates X v (z) of z with respect to the vertices v G V of T : 



Mz 



min X v (z) 



According to Lemma 5.4.2 , we have for any z,z' G T and any v G V 

\X v (z) - X v (z')\ < \\z - z'\\ Ht , 



hence 



*) T {z) - 4>t{z')\ < max \X v (z) - X v (z')\ < \\z - z'\\ Hl 



(5.111) 



For all n > we define the function 



i d -»■ [0, 1] as follows 

= 



TeT„ 



It follows from (5.110) and (5.111), and (5.35), that for all T G T n and all z G int(T) 

dil z (0 n ; d H ) < C . 



Since in addition 



on 1 \ 0„, Corollary 5.2.6 implies that 



,d H ) ->■ 



is Co-Lipschitz. It follows from Ascoli's compactness theorem (for instance the version 
recalled in Theorem 6.2.3 in the next section) that there exists subsequence (0 n Jfc>o 
which converges uniformly on all compact sets of lR d to a Co-Lipschitz function 0. 

We therefore assume, up to such an extraction, that <j) n — > <p uniformly on all compact 
sets of M. d . We denote by T the collection of closures of connected components of the set 
{z G W l ; (j)(z) > 0} : 



T := {E ; E is a connected component of {z G 



>0}} 



For each zetf and each n > 0, we denote by T n (z) an element of T n containing z if it 
otherwise. Likewise we denote by T(z) an element of T containing 



exists, and T n {z 
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z if it exists, and otherwise. If <\>{z) > 0, then z G mt(T n (z)) for all n sufficiently large, 
and one easily checks that 

T n (z) converges in Hausdorff distance to T(z) G T . (5.112) 

Hence T is a collection of simplices, and the convergence T n (z) — > T(z) in Haussdorf 
distance, combined with the hypothesis Cq 2 H(z) < T n (z) < CqH(z), implies that 

C 2 H(z) < T(z) < C$H(z). (5.113) 

In order to conclude this proof we need to show that the collection T of simplices is a 



conforming mesh of M. d . We thus have to check the hypotheses of Definition 5.1.4 The 
interiors int(T) of the simplices T G T are pairwise disjoint since they are the connected 
components of the set {z G M. d ; <p(z) > 0}. We claim furthermore that T covers M. d . 
Indeed let z G lR d be arbitrary and let n > be such that z G f2 n , hence T n (z) ^ 0. Since 
Cq 2 H(z) < %T n {z) < CqH(z) there exists a subsequence {T n , k) (z))k>o which converges 
in the Haussdorf distance to a simplex T containing z. We clearly have T G T which 
establishes as announced that T covers M d . The fact that T is locally finite easily follows 



from (5.113). 



We now turn to the proof of the conformity property, and for that purpose we consider 
an arbitrary Lipschitz function / : (IR d ,<i#) — > R with compact support. Let T be a 
simplex such that Cq 2 H[z) < %t — CqH(z). We have for any vertex v of T 

I/O) - fM\ < d H (z T} v) < C \\z T - v\\ Ht = Co- 
Denoting as before by V the collection of vertices of T, and by (X v ) v5 v t ne barycentric 
coordinates on T, we thus obtain for any z,z' G T 

l T f(z) - I T f(z') = ]T(A^) - \ v (z'))(f(v) - f(z T )), 

where It denotes the Pi (piecewise affine) interpolation on a simplex T. Hence using 
Lemma 15.4.21 

|It/(*)-It/COI < c J2\K(z) - X v (z')\ 



AI2 



< C (d+l)^2\X v (z)-X v (z- 
= C Vd\\z-z'" 



\H T - 

The last inequality implies that 

dil 2 (I Tn /; d H ) < C 2 Vd 

for all T G T n and all z G int(T), where I7-/ denotes the Pi interpolation on a mesh 
T' G T. Let tiq > be s uch th at supp(/) C ft no , which implies that lj- n f is continuous. 



It follows from Corollary 5.2.6|that l Tn f ■ (K d , d H ) -> R is C5 2 Vc/-Lipschitz. 
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We define D := Urer^iT), and we (abusively) denote by I7- / : D — > K the function 
which coincides with 1^ / on the interior of each T G T. For any z G £) we obtain using 



(J5_112j) that I Tn /(z) -»■ I7- f(z), hence for any z,z' E D 

\l T f(z)-l T f(z / )\= lim |I Tn /(z)-I Tn /(z')|< VdC 2 Mz,z'). 

n— >oo 

The piecewise interpolant I7-/ : D — > R therefore extends to a v^C^-Lipschitz function 
I7- / : g?#) — >■ E. It easily follows that I7- / is continuous for any compactly suppor- 
ted C 1 function /. This property characterizes conforming meshes, hence T G T which 
concludes the proof. 



Corollary 5.5.6. (Bidimensional triangulations) Assume that the dimension is d = 2. 
There exists c > such that the following holds : if s : IR 2 — > M.*, is c-Lipschitz, then there 
exist a mesh T of IR 2 built of half squares, and such that for allT eT and z G T 



diam(T) < s(z) < 3diam(T)/2. 



(5.114) 



Proof: When the dimension is d — 2, the collection To of triangles contains on ly half 
squares, of area 1/2, hence D + = D_ = y/2. As c — > the constant K c defined by (5.105) 
tends to a/2. Hence K c < 3/2 when c is sufficiently small. 

From this point we construct as before a sequence of meshes (T n )n>n of the squares 
[— 2 n ~ 1 , 2" _1 ] which satisfies (5.107) hence (5.114), and we extract a global mesh T of IR 2 



using Lemma 5.5.5 The triangles T G T still satisfy the inequality (5.114) since they are 
limits in the Haussdorff distance of triangles from the triangulations T n - o 



5.5.2 Graded mesh generation 



We prove in this section a result of bi-dimensional mesh generation, Theorem 5.5.8 



which implies Point (iii) of Proposition 5.5.1 The key ingredients of this section come 



from the paper (66). We assume throughout this section that the dimension is d = 2. 

We first introduce the notion of equispaced points in an abstract metric space. Let 
(X, dx) be a metric space and let 5 > 0, we say that a subset V C X is 5-equispaced if 
the two following properties hold. 

a) (Covering) The distance from an arbitrary point x G X to V is bounded by 1 : for all 
x G X 

d x {x,V) := mf{d x (x,v) ; v G V} < 1. (5.115) 

b) (Separation) The pairwise distances between the points of V are larger than 5 : for all 
v, v' G V such that v 7^ v' 

d x {v,v')>5. (5.116) 



The next lemma establishes the existence of such a set under certain assumptions. 
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Lemma 5.5.7. Let (X,dx) be a metric space in which the closed balls B'(x,r) := {y G 
X ; d x {x,y) < r} are compact for all x G X and all r > 0. Let Vo C X be such that 
dx(v,v') > 1 for all v,v' G Vo such that v ^ v' . Then there exists a 1-equispaced set V of 
the metric space (X,dx), containing Vq. 

Proof: We may assume that Vo 7^ 0, up to including an arbitrary point of X. We choose 
an arbitrary point xq G Vo, and we define inductively a sequence (x n )o<n<iV of points and 
(V n )o<n<iv of subsets of X, where N G IN U {00} is specified later, proceeding as follows. 
Assuming that V n is already defined we consider the closed set 

X n := {x G X ; d x {x, V n ) > 1}. (5.117) 

If the set X n is empty, then the sequence ends and we define N :— n + 1. Otherwise we 
choose x n+ \ arbitrarily among the minimisers of dx{x,Xo) on X n : 

x n+ i G aigmmdx(x,x ), (5.118) 

x&X n 

and we define V„+i := V„ U {x n +i}- Such a minimizer exists since X n is closed and since 
the closed balls in X are compact. We define N := 00 if none of the sets (X n ) n > is empty. 
We define V := U < n< x>V n . By construction, we thus have dx(v, v') > 1 for all v , v' G V 



which establishes the separation property (5.116) 



We now assume for contradiction that the covering property (5.115) does not hold, 



which implies that N = 00. We choose a point x* E X such that g?x(^*,V) > 1, and 



we remark that x* G X n for all n > 0. The definition (5.118) of x n +i implies that 
dx(xo,x n ) < dx(x ,x*) for all n > 0, hence the collection of points {x n ; n > 0} is 
included in the closed ball {x G X ; ^(^O)^) < ^x^cb^*)} which is compact. The 
sequence (x n ) n > therefore admits a converging subsequence, but this contradict the fact 
that the pairwise distances between these points are larger than 1, which concludes the 
proof of this proposition. o 



We consider a fixed metric H G H. For any x, y G 1R 2 we define 

dx(y) ■= \\x - v\\h(x)- 

We also consider a fixed discrete collection of vertices, or "sites", Vci 2 , and we define 
the (anisotropic) Voronoi cell of a site v G V as follows 

Vor(v) := {zER 2 ; d v (z) < d w (z) for all w G V}. (5.119) 

The (anisotropic) Voronoi cell Vor(v) is thus the collection of points z which are closer to 
the vertex v than to any other site. The Voronoi diagram is the collection {Vor(t> ) ; v G 
V} of all Voronoi cells. The classical isotropic Voronoi is obtained by choosing H = Id 
identically on M. d . An instance of an (anisotropic) Voronoi diagram is illustrated on Figure 
EH 

For each z G M 2 we denote by V 2 C V the collection of vertices which contain z in 
their Voronoi region : 

V z := {v G V ; z G Vor(w)}. 
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Figure 5.7 - An (anisotropic) Voronoi Diagram (left), 
J.R.Shewchuck |66|) 



a wedge (right), (credit 



The geometric dual of the Voronoi diagram is the graph Q(V, H) defined as follows : 

Q(V, H) := {[v, v'] ; there exists z eM 2 such that V z = {v, v'}}. (5.120) 

The next theorem establishes that Q(V,H) is, under certain conditions on the metric 
H and the set V, the graph of a triangulation adapted to H. 

Theorem 5.5.8. For each 5 > there exists c = c(5) > and C = C(5) > 1 such that the 
following holds. Let H be a metric such that c 2 H G H g , and let V be a 5-equispaced subset 
of (IR 2 , dn)- Then the Q(V, H) is a planar graph which defines a partition o/IR 2 into strictly 
convex polygons. Arbitrarily triangulating each polygon yields a triangulation T G T g ,c 
which is C -adapted to H , and called an anisotropic Delaunay triangulation. Furthermore 
ifV is in general position then Q(V, H) is already the graph of a triangulation. 

Denote by c and C the constants attached to S = 1 in this theorem. It follows that for 
any metric H such that c 2 H G H s , there exists a mesh T G T^ g ,c which is C-adapted to H, 
obtained as the Delaunay triangulation of a 1-equispaced set of points V C (1R 2 , dn)- Such 

since the balls of the metric space (M 2 , dn) are com- 
This immediately implies Point iii) of Proposition 



a set exists according to Lemma 5.5.7 
pact according to Proposition 5.2.10 



5.5.1 according to Remark |5.5.2 



The rest of this section is devoted to the proof of Theorem 5.5.8 which is based on 
the methods and results presented in [66]. For that purpose we recall some definitions of 
this paper. 

Definition 5.5.9. - (Wedge of two points) We define the wedge wedge(t>,u>) of two 
distinct sites v,w as follows 

wedge(f , w) := {z G 1R 2 ; (z — v) T H(v)(w — v ) > and (z — w) T H(w)(v — w) > 0} 

- (Wedge property) We say that the Voronoi diagram of V is wedged if for any two 
distinct v,w G V we have 



Vor(v) n Vor(w) C wedge(u, w). 
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The wedge property expresses that the collection Vor(-u) n Vor(w) of points shared by 
the Voronoi regions of the sites v and w lie in a cone defined by two linear inequalities. 



The wedge of two points is illustrated on Figure 5.7 (right). Note that the wedge property 
is not satisfied on this figure. 

For any metric H and any constant c > such that c 2 H £ H„, it easily follows from 



Proposition 5.2.10 that for all x, y £ M 2 

ln(l + cd x (y)) < cd H (x,y) < -ln(l - cd x (y)), 
which is equivalent to 

1 -exp(-cd H (x,y)) < cd x (y) < exp(cd H (x,y)) - 1. 



(5.121) 



(5.122) 



Lemma 5.5.10. The following holds for any 5 > and any c > 0. Let H be a metric 
such that c 2 H £ H g and let V be a 5-equispaced subset of (R 2 ,d H ). Then for any v £ V 
and any q £ Vor(i>) one has 



dv{q) ■= \\q -v\\ H (v) < r (c) := (e c - l)/c. 
// v,w £ V satisfy Vor(t>) fl Vor(w) ^ 0, then 

d H {v,w) < n(c) := -21n(2 - e c )/c. 
A^ote that r (c) — > 1 and ri(c) — > 2 as c — >• 0. Note also that du{v, w) > 5. 
Proof: For any q £ V there exists t> * £ V such that dn{q,v^) < 1, and therefore 

\\q - v*\\h(v*) < r {c) := (e c - l)/c, 



(5.123) 



(5.124) 



which implies (5.123) using the definition (5.119) of the Voronoi diagram. We thus turn 



to the proof of (5.124) and for that purpose we consider a point q £ Vor(i>) fl Vor(w). We 



thus have \\q — v\\m v ) = \\q — w\\h( w ) < r o( c ) hence using (5.121) 



d H (v,w) < d H (v,q) + d H (q,w) < ri(c) := -2fn(l - cr (c))/c, 
which concludes the proof. 



o 



The next proposition, which is partly is based on Lemma 5 from 66 , shows that the 
wedge property is automatically satisfied in our context. 

Proposition 5.5.11. For each 5 there exists c = c(5) > such that the following holds. 
If a metric H satisfies c 2 H £ H 9 and if V is a 5-equispaced subset of (R 2 ,dH), then the 
wedge property is satisfied. 

Proof: We assume for contradiction that there exists two distinct vertices v, w £ V , and a 
point q £ Vor(V) nVoi(W) such that q £ wedge(t>, w). We may assume, up to exchanging 
the roles of v and w, that 

(q-w) T H(w)(v- w) < 0. 
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Defining the "relative distortion" r := expd x {H(v),H(w)) we obtain using Lemma 5 



from 66 



d w {v) < d w (q)V 



1. 



(5.125) 

We now estimate the quantities d w (v), d w (q) and r appearing in this estimate, in order 
to obtain a contradiction when c is sufficiently small. 
We have 

lnr = d x (H(v), H{w)) < cdn(v,w) < eri(c). 
Since V is 5-equispaced we have dn(v,w) > 6 and therefore 



cd w (v) > 1 — exp(— cdn(v , w)) > 1 — exp(— c6). 
Injecting gT2§ , ( |5l26| and jQ27) in ( [5125] ) we obtain 

1 - eXp( -^ ) < eXp(c) ~ 1 v / exp(2cr 1 (c))-l . 



(5.126) 
(5.127) 



(5.128) 



The left hand side tends to 5 as c — > 0, while the right hand side tends to (and only 
depends on c). We thus obtain a contradiction when c is sufficiently small which concludes 
the proof of this proposition. o 



Theorem 5.5.12 (Dual triangulation theorem. Labelle, Shewchuck, [66]). Let H G H 

and let V C M 2 be a discrete point set. If the Voronoi diagram of V is wedged, then the 
geometric dual of this diagram is a polygonalization of M 2 with strictly convex polygons, 
and is a triangulation if V is in general position. 

Under the hypotheses of this theorem the vertices v i, • • • , of a polygon in the dual 
Voronoi diagram satisfy by construction Vot(vi) PI • • • H Vor(wfc) ^ 0. Let T be a trian- 
gulation obtained by arbitrarily triangulating these polygons. The vertices Vi,v 2 , v 3 of a 
triangle TgT satisfy Vor(t>i) fl Vor(w 2 ) D Vor(v 3 ) ^ 0. 

The next theorem, originally stated in [66] as Corollary 10 to Theorem 9 of the same 
paper, allows to control the angles of a triangle in the anisotropic Delaunay triangulation. 



Theorem 5.5.13 (Labelle, Shewchuk, [66]). Let Vi } v 2 ,v 3 e V and let q G Vor^x) fl 
Vor(t> 2 ) fl Vor(t> 3 ). Let 

r = d Vl (q) = d V2 (q) = d Vz (q) and I = mm{d Vi (vj) ; i,j G {1,2, 3}, 2 ^ j}. 

Define (3 := max{r//, l/\/2}, 

7 := expmax{d x (H(vi), H(vj))l < i < j < 3}, 

and 

X ■= ^ - V-^. (5.129) 



Ifl< v^2 and x > then any angle 9 of the triangle of vertices ( v / ^ff(^Jt; fc ) 1 < fe <3 satisfies 

arcsin(x/7 2 ) < 9 < 2 arccos(x/7 2 )- 
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Corollary 5.5.14. For each 5, < 8 < 1 there exists c = c(<5) > and C = C(S) > 1 
such that the following holds. Let H be a metric such that c 2 H G H g and let V be a 
S-equispaced subset of (R 2 ,dn)- Ifvi,v 2 ,v 3 G V and if Vor(ui) fl Vor(t> 2 ) fl Vor(t> 3 ) ^ 0, 
i/ien denoting by T the triangle of vertices v i, v 2 , t>3 we /iove for all z G T 

C- 2 H(z) <H T < C 2 H(z). 

Proof: We first obtain obtain some explicit bounds on the quantities r, I, (3, 7 and \ 
appearing in Theorem 5.5. 13[ It follows from Lemma 5.5. 10| that 



Therefore 



and 



f < r o( c ) an d 8 < dn(vi,Vj) < ri(c) 



Z := minc?^^) > r 2 (c) := (1 - exp(— c5))/c, 
L := max. d Vi (vj) < (exp(cri(c)) — l)/c, 



7 < maxexp(c(i//(vj, f-,)) < exp(cri(c)). 



It follows that f3 < r 3 (c) := min{r (c)/r 2 (c), l/y/2}. Injecting this into (5.129) yields 

1 (exp(2cr 1 (c)) - l)r 3 (c) 



X > r 4 (c) := 



2r 3 (c) 



and therefore x/7 2 > r s( c ) := r 4 (c) exp(— 2cri(c)). 

For any fixed 5, < 5 < 1 we obtain as c — > the limits r (c) — >• 1, ri(c) — >■ 2, 
^(c) — >■ 5, T3(c) — > mm{5, 1/a/2}, r 4 (c) — >■ l/(2min{5, l/\/2}) and r 5 (c) has the same 
limit. 

We may therefore choose c sufficiently small in such way that 

I > k := 5/2, L < L := 4 and x/7 2 > Xo := 1/(45). 

We denote by T the triangle of vertices t>i,t>2,t>3. The length of any edge of T" : = 
yjH(v\)(T) is bounded above by L and below by l respectively, and the angles of T' 
satisfy arcsin(Xo) < & < 2arccos(Xo)- The collection of triangles centered at the origin 
which satisfy these inequalities is compact, thus there exists a constant C\ = C\(5) such 
that 

C7 2 Id < H T > < Cl Id, 

hence C^ 2 H( Vl ) < U T < < C\E{v x ). For any z G T we obtain, remarking that \\z— t>i||-H T < 
2 for all z G T and using (5.53), 

Cf 2 (1 - 2cC 1 ) 2 H(z) < H T > < C?(l - 2cC l )- 2 H(z). 

We may assume that c < l/(4Ci), which concludes the proof of this proposition with 

C = 2C V o 
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We now prove of Theorem |5.5.8 According to the above results for any 5 > there 
exists c = c(S) and C = C(S) such that the following holds. For any metric H such 
that c 2 H E Hg, and any set V C (IR 2 , du) which is 5-equispaced, the Voronoi diagram 



associated to V and H is wedged according to Proposition |5 . 5 . 1 1) The dual of this diagram 
is therefore a polygonalization of IR 2 with strictly convex polygons according to Theorem 



5.5.12| Randomly triangulating these cells yields a triangulation T E T which is C- 

and a point 



adapted to H according to Corollary 5.5.14 



Furthermore consider to simplices T,T' E T such that T D T" ^ 
zeTDT'. Then 

C- 2 H T < H(z) < C 2 H T >, 



which shows that T E T g c 2 ) an d concludes the proof of Theorem 5.5.8 



5.5.3 Quasi-Acute mesh generation 



This section is devoted to the proof of Point ii) of Proposition 5.5.1 , which is obtained 



by combining the constructions of isotropic and graded triangulations presented in £5.5.1 



and §5.5.2| to produce a quasi-acute triangulation. The key idea of this proof if that the 
anisotropic Delaunay triangulation described in the previous subsection contains only 
few strongly obtuse angles if the set V of vertices is structured along lines which are 
transverse to the direction of anisotropy, as illustrated on Figure 5.2 (in the introduction 
of this chapter). 

We consider a fixed metric H such that c 2 H E H a , where c is an absolute constant 
specified in the proof and such that < c < 1. This subsection is devoted to the construc- 
tion of a mesh T E T a ,c which is C-adapted to H, where C is an absolute constant also 



specified in the proof. This construction immediately implies Proposition 5.5.1 according 
to Remark 15. 5. 2[ 

For each z E IR 2 we define 



X(z) 



\H(z) 



and fi(z) = \\H(z) 



and we observe that X(z) > fi(z). The functions A and fi are c-Lipschitz according to 
Corollary |5.2.8| If c is sufficiently small, then there exists according to Corollary |5.5.6| a 



mesh 7o such that for all T E To and all z E T 



-\{z) < diam(T) < X(z). 



(5.130) 



The mesh % is built of half squares by construction (as illustrated on Figure 5.6). The 
diameter of any triangle in To is a power of v^2, hence for any two triangles T,T' E % 



sharing a vertex v we obtain using (5.130) 



diam(T)/\/2 < diam(T') < v^diam(T). 



(5.131) 



For any edge [v,w] of any triangle T E To we have diam(T)/\/2 < \v — w\ < diam(T), 
hence for any two edges [v,w] and [u, itf'] (sharing the vertex v) of the triangulation 7o 



(5.132) 
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Inequality (5.130) also implies that for any edge [v,w] of To 

X{z^2 < diam(T) < {y _ w{ < < ^ 

3 \/2 

We denote by Vo the collection of vertices of %■ Consider a fixed v G Vo and any vertex 
m G Vo such that \v — w\ is minimal. Then [v,w] is clearly an edge of To, since this 
triangulation is conforming and built of half squares. This implies for any distinct v , w G Vo 

\/2 

X(v)^—<\v-w\. (5.134) 
o 

For each z G IR 2 we define 



p{z) := X{z)/p{z) = y/\\H(z)\\\\H{z)-^\\ G [1, oo). 

Consider z, z' G M 2 such that \z — z'\ < r\(z), where r > is a constant. We obtain since 
A and /i are c-Lipschitz 

p(z') - W > X ^ l ~ rc ) > l ~ rc (5 135) 

For each po > 1, r > we denote by c p (po,r ) > the constant such that p = 

l-ro C p(po,r ) Rence f aU / R 2 

(po+1) i +r - OCp(po,Co) 

1 2 — z'\ < r X(z) 
p{z) > Po + 1 implies p(z') > p 
c < c p (po,r ) 

For each z6K 2 such that p(z) > 1 we denote by 9{z) 6 § := {w 6 I 2 ; |w| = 1} a 
unit vector such that 

H(z) = X(zy 2 6(z)6(zf + Mz)- 2 (Id -6(z)6(z) T ). (5.136) 

Note that there exists two, opposite, choices for the vector 9(z). For each u, u'6K 2 \ {0} 
we define the (unoriented) angle 



<(w, u') := arccos 

and 



<(«,«') := mm{<(w, w), <(w, -«')} = arccos ( , ) G [0,7T/2]. 

Lemma 5.5.15. If c is sufficiently small, then the following holds. For any z,z' G K 2 

{ V - A <~ioa(*) implies < {e{z) > e{z>)) - n/12 - 

Likewise, p(z) > 4 and \z - z'\ < X(z) implies <{6{z),6{z')) < l/(10\/2). 



260 



Chapter 5. Are riemannian metrics equivalent to simplicial meshes ? 




Figure 5.8 - Neighborhood Vj- (v) of a simplex v in the triangulation 7o (left), the 
neighbors w + and w_ of a vertex v in % (right). 



Proof: We consider a fixed point z such that p(z) > 2 and we denote B := {z' G 
M 2 ; \z' — z\ < 10A(z)}. For each z' G B we have 

X(z') — p(z') > (X(z) — c\z — z'\) — (}i(z) + c\z — z'\) 

= X(z) — fj,(z) — 2c\z — z'\ 

> X(z) - X(z)/2 - 20cA(^) 

= A(z)(l/2-20c), 

hence X(z') — fi(z') > X(z)/3 if c < 1/120, which we assume from this point. 

Furthermore since B is simply connected there exists a continuous function 9* : B — > § 
such that 9^{z') = ±8(z ; ) for all z' G B. It follows from Theorem 5.3.1 that for all z' G B 

X{z) diW(0*) < 3(A(;0 - dily(^) < 3c. 



Therefore, since the distance d§ on S involved in Theorem 5.3.1 coincides with the angle 
< we obtain 

<(9(z),6(z')) < <(e,(z),0.(z')) < \z-z'\ max dil 5 (0*) < 30c 

which concludes the proof if 30c < 7T/12. The second estimate is obtained similarly. o 
For each vertex v G Vo we define a set Y v C 1R 2 as follows : Y v := {v} if p{y) < 3, and 



r„ := 



v 2w- 



U 



v 2w + 



(5.137) 



if p(v) > 3, where [v, wJ\ and [v ,w+] are two edges of 7o which respectively minimize and 
maximize the quantity (9(v) , w — v) (among the edges containing v). In other words for 
any edge of 7o of the form [v,w] we have 



{9(v) L ,w. -v)< (9(v) ± , w-v) < (9{v) x ,w + - v). 



(5.138) 



This property is illustrated on Figure 5.8 (right). 
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Lemma 5.5.16. let v G Vo be such that p(v) > 3, and let [v,w+], [v,w-\ be two edges of 

(5.139) 



% satisfying (5.138). We have 

<(9(v) ± ,w + - v) < 7?/4, and <(9(v ) ± , v - wJ) < 7T/4. 



Proof: We restrict without loss of generality our attention to the proof of the first in- 
equality, and we assume for contradiction that p> := <(9( y v) ± ,w + — v)> 7T/4. Among the 
two vertices w G Vo such that the triangle of vertices v, w, w + belongs to %, we choose the 
one such that <(6 ,± , w — v) is minimal. We denote if) := <(u> — v, w + — v) G {tt/4, 7T/2} 
and we obtain 



(9(v) ± , w + — v) = \w + — v | cosy? and (^(w)" 1 , w — v ) 



v| COs((y9 — if)). 



If if) = 7T/2 then \w + 



\w 



v\ since the triangles T 6 To are half squares, and 



cos((f — ip) = siny?. If follows from (5.138) that cos<y? > simp which contradicts our as- 
sumption that y?>7T/4. If ^ = 7T/4 then \w + — v\ = \/2\w — v\ or \w + — v\ = \w — v\/y2. 
We thus have \/2cos<£> > cos(y? — 7T/4) which again contradicts our assumption that 
p> > 7T/4. o 



We define T := U v£ \; T v , and for each z G T \ V we define 

r 2 := |J rv 



(5.140) 



Such a set T z is illustrated on Figure 5.9 (right). For each z G T there exists v G Vo such 
that T z = T v , or there exists v, v' G Vo, and w, w' G Vo, such that z G [v, v'] and 



r„ u n, 



v + 2w 




' . v' + 2w'~ 




U [u, «'] U 




[ 3 




r 3 J 



Note that for any vertex v G Vo one has r„ fl Vo = {v}, hence (5.140) agrees with (5.137) 
for any z G Vo- Note also that for any z, z' G V the following are equivalent 

- z' g r„ 

- There exists v G Vo such that {z, z'} C r„, 

- z g r zl . 

The next proposition gives a robust estimate of the orientation of the set T. 

Lemma 5.5.17. The following holds if c is sufficiently small. Let z aY, let p,q G T z be 

two distinct points, and let z' be such that \z — z'\ < b\(z). Then 

<(9(z') x ,p-q) < tt/3. 

Proof: We first remark that T z is not a singleton since the points p and q are distinct. 
Therefore T z = T v UT v r, or T z = T v , for some v,v' G Vo satisfying p{y) > 3 and p{v') > 3. 
We focus on the case of the union T v U T v i . 
Since \v — z\ < min{A(z), X(v)} we obtain 

\v - z'\ < \v - z\ + \z - z'\ < X(v) + 5X(z) < X(v) + 5(X(v) + c\v - z\) < X(v)(Q + 5c). 
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R 2 Wv) 





-x *- 



Figure 5.9 - Illustration of the proof of Proposition 5.5.18 



If c is sufficiently small, we thus have \v — z'\ < 10\(v ) and therefore <(8(v),6(z')) < 7T/12 



according to Lemma 5.5.15 Any of the two tangents t to the polygonal line T v thus satisfies 



<(0(2') , *) < <(9(vr,t) + <(9(v), 6{z)) < 7T/4 + 7T/12 = 7T/3. 

Proceeding likewise we obtain that <(8(z') ± , t) < 7T/3 for any of the (three) tangents to 
the continuous polygonal line T z = T v U F v i, and therefore <(9(z') ± ,p — q) < 7T/3 for any 
p, q on this line which concludes the proof. o 

The next proposition gives some upper and lower bounds on the distance from a point 
to the set Y. 



Proposition 5.5.18. If c is sufficiently small, then the following holds. For any z G T, 



min{|z - e| ; e eT\T z }> —X(z) 

y i 



(5.141) 



and for any z G 



min{||2; - e\\ H (z) ] e G T} < 2 



(5.142) 



Proof: We first establish (5.141). Let z G T and let v G Vq be such that z G r„. There 



exists an edge [v,u>*] of % and a parameter a G [0,2/3] such that z — (1 — a)v + aw*. 
We denote by V%(v) the union of all triangles T E % containing v, and by Vq(v) the 
collection of vertices w such that [v, w] is an edge of T. We thus have 



r\r z cW\v To (v) |J 

iueVo(v)\{M*} 



2w + w 



w 



The set appearing in the right hand side is illustrated on Figure 



5.9 



(left). Denoting 
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d(z, E) := inf{|z — e| ; e G E} we obtain 

d(z,T\T z ) > mm \d(z,R 2 \V To (v)), min d[z, 



weV (v)\{w*} 



2v + w 



w 



> min 



> 



mm 



V2 ' 3\/2 ™sv («)\W 



\v — w 



mm m — w\ 



3a/2 weVoW 

> 

- 3^ 3 

^ A(z) — c\z — v | A(^)(l — c) 

~ 9 _ 9^ ' 



where we used (5.133) in the fourth line, and the fact that A is c-Lipschitz in the last line. 



This establishes (5.141) if c is sufficiently small. 



We now turn to the proof of (5.142). Let T G % and let z G T. Since T is a half square 



there exists a vertex v of T such that \z — v\ < diam(T)/2, hence 



\z — v\ diam(T) X(z) 

\Z-V\\ H {Z) < - 1 < - . ■ < 



2fi(z) ~ 2fi(z) 



which concludes the proof if p(z) < 4. If p(z) > 4 and c < c p (3, 1) then p(y) > 3 for any 
vertex v of the triangle T. 

We choose two vertices v, w of T such that the line [z + r9(z)) re - R intersects dT on the 
segment [v, (v + w)/2\. We denote by u>_, w + G Vq the neighbors of v in % which satisfy 



(5.138). Our first step is to establish that the line (z + r6(z)) rE j& intersects the set r„, 
which is heuristically due to the fact that the angle <(8(z), 6{v )) is small and that the 
vertices W- and w + are the furthest away from v in the direction of 6{v) x as illustrated 



on Figure 5.8 (right). 



It follows from (5.132) an Lemma 5.5.16 that 



(9(v ) ± , w + — v ) > \w + — v\ cos(7T/4) > 



\w — v\ 1 



y/2 



(5.143) 



We assume without loss of generality that (6(z),8(v)) > which implies according to 
Lemma 15.5.151 



\6{z) - 6{v)\ < <{8{z), 9(v)) = <{6{z), 6{v)) < S := 1/(10^). 



It follows that 



(d(z) L ,w-v) = (O(z) 1 -0(v) ± ,w-v) + (6(v) ± ,w-v) 

< \9(z) -6(v)\\w -v\ + (6{v) ± ,w + -v) 

< (l + 2SV2)(9(v) ± ,w + -v), 
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where we injected (5.138) in the second line, and (5.143) in the third line. Denoting 



6 := <(9(v) ± ,w + - v) < 7T/4 and $ := <(9(z) ± ,w + - v) we obtain 

(9(z) ± ,w + -v) cos($) > cos(9)-|$-e| _ 1 |$-9| >1 5 ^ 



(9(v)- L ,w + -v) cos(0) 



cos(6) 



cos 6 



hence 



(9(z)\w -v)< 1 -±^ ( (z)\ w + -v) = \ (9(z)\w + - v). 



Proceeding likewise for w_ we obtain 

\{9{z)^^ -v) < (9(z) ± ,w-v) < ^(6{z) ± ,w+ -v), 

and therefore since z + r6(z) G [v, (v + w)/2] for some r e M, 

2 -(9{z)^,w„-v) < (9(z) ± ,z-v) < 2 -(9(z) ± 1 w + -v). 

It follows that the line (z + r9(z)) r£R intersects the set T v := [v, (v + 2w+)/3] U [v, (v + 
2w_)/3] at some point z' . The point z' belongs to a triangle T' containing v, hence 

X(z)\\z' - z\\ H{z) = X(z)\\r9(z)\\ H(z) 

= \z-A 

< \z — v\ + \v — z'\ 

2 

< diam(T) H — diam(T') 



< (l + ^V2j diam(T) < (l + A(; 



where we used (5.131) in the last line. This concludes the proof of this proposition. 



o 



We construct in the next lemma a collection V of sites by sampling the metric space 
(r, dff), which is the collection of vertices of our future quasi-acute triangulation T. This 
set is illustrated by small crosses on T z on Figure [B~9| (right). 



Lemma 5.5.19. The following holds if c is sufficiently small. There exists a discrete set 
V C T containing Vo and satisfying the following : 

i) (Separation) For allv,w G V, dn(v,w) > 1/10. 

ii) (Distance from F ) For all z G T, dn{z, V) < 1/10. 

Hi) (Distance from R 2 ) For all z G M 2 , d H (z, V) < 2 + 1/4. 



Proof: For any distinct v,w G Vo we obtain using (5.134) and Proposition 5.2.10 



d H (v,w) > ln(l + c\\v — w\\ H (v))/c 

> ln(l + c\v - w\/\(v))/c 

> ln(l + cV2/3)/c 
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which is larger than 1/10 if c is sufficiently small. 

We denote by V a 1-equispaced subset of the metric space (r, 10 dn) containing Vq, 



which exists according to Lemma 5.5.7 The two first announced properties, separation 
and distance from T, follow directly from this construction. 

The third property is obtained as follows : for any z £ IR 2 there exists according to 
Proposition 5.5.18 a point e £ T such that \\z — e\\mz) < 2. Furthermore there exists a 



vertex v £ V such that c?#(e, v) < 1/10, hence 

d H (z,V) < d H {z,e) + d H {e,v) < -ln(l - c\\z - e\\ H{z) )/c+ 1/10 < - ln(l - 2c)/c + 1/10. 
which is smaller than 2 + 1/4 if c is sufficiently small. 



o 



The next lemma introduces the anisotropic Delaunay triangulation T associated to 
the metric H and the set of points V. 

Lemma 5.5.20. There exists an absolute constant Cq > 1 such that the following holds 
if c is sufficiently small. The set V is wedged with respect to the metric H , and the 
anisotropic Delaunay triangulation T obtained by arbitrarily triangulating the cells of the 
graph Q(V,H) is Co-adapted to H and belongs to T g ,c - 

Proof: We denote a := (2 + 1/4) -1 throughout the proof of this lemma. It follows from to 
Lemma 5.5.19 that the set V is a a/10-equispaced subset of the metric space (M. 2 ,adn)- 



Theorem 5.5.8 applied to the metric a 2 H, therefore implies that, if c is sufficiently 
small, the set V is wedged for the metric a 2 H and that arbitrarily triangulating the 
convex cells of the graph Q(V,a 2 H), which are convex yields a triangulation T which is 
C-adapted to the metric ot 2 H (hence C := C/a-adapted to H), where C is an absolute 
constant. 

It immediately follows from the definition (5.120) that the graphs Q(V,a 2 H) and 



G(V, H) are equal, and Definition 5.5.9 shows that the wedge property holds for H if and 
only if it holds for a 2 H. 



o 

The key ingredient of the construction of the anisotropic Delaunay triangulation is the 
anisotropic Voronoi diagram introduced in (5.119). Our first lemma compares the Voronoi 
regions with some balls. We recall that Bjj(z,r) := \z' £ IR 2 ; \\z' — z\\ < r}, and B(z,r) 
denotes the usual euclidean ball of radius r centered at z. 

Lemma 5.5.21. If c is sufficiently small, then the following holds. For any v £ V 

1. B H (v, 1/25) C Vor(w) 

2. £#0,2 + 1/3) D Vor(w) 

3. B(v, i/i(v)/i9) d Voi(v) nr = Vor(w) n r,, 

Proof: We first establish Point 1., and for that purpose we consider z £ <9Vor(t>), hence 
z £ Vor(t>) n Vor(w) for some vertex w £ V. We define r := \\z — = \\z — w\\h( w ) 
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and we obtain using Point i) of Lemma 5.5.19 



1/10 < d H (v,w) 

< d H (z,v) + d H (z,w) 

< -ln(l -c||^-v||/r(„))/c-ln(l - c\\z - v\\ H{v) )/c 
= -2 ln(l - cr)/c. 

It follows that 

r > (1 - exp(-c/20))/c, 

and therefore r > 1/25 if c is sufficiently small, which concludes the proof of Point 1. 
We now turn to Point 2, and for that purpose we remark that for any z G M? there 



exists according to Point iii) of Lemma 5.5.19 a vertex w G V such that dniz, w) < 2 + 1/ '4. 
It follows that 



\z — w 



\h{w) < (exp(cd H (z,w)) - l)/c < (exp(c(2 + 1/4)) - l)/c. 



Hence \\z — w\\h( w ) <2 + l/3ifcis sufficiently small. It thus follows from the definition 
(5.119) of the Voronoi diagram that \\z — v\\h(v) < 2 + 1/3 for all z G Vor(t>), which 



concludes the proof of Point 2. 

We now turn to Point 3, and for that purpose we remark that for all z G T there exists 



according to Point ii) of Lemma 5.5.19 a vertex w G V such that dH(z,w) < 1/10. It 
follows that 



\z — w 



\h(w) < (exp(cd H (z,w)) - I)/ 'c < (exp(c/10) - l)/c. 



Hence \\z — w\\h(w) ^ 2/19 if c is sufficiently small. It thus follows from the definition of 
the Voronoi diagram that \\z — v\\h(v) < 2/19 for all z G Vor(f ) fl T. Therefore 



\z — v\ 



< \\z — v\ 



H(v) 



2 10 

< — < — 
~ 19 91 



which implies that z G T v according to Proposition 5.5.18 We thus have <\(9(v) , z — v)< 



7T/3, according to Lemma 5.5.17, and therefore 

\z — v ' 



cos(7T/3) < \\z - v\\ H (v) < 



19 



which concludes the proof of this proposition. 



We recall that, by definition of the anisotropic Delaunay triangulation, the Voronoi 
regions Vor(w) and Vor(u>) intersect for any edge [v,w] of T. We say that an edge [v,w] 
of T is transverse if Vor(w) fl Vor(w) fl V ^ 0, and we say that [v, w] is aligned otherwise. 

Our next intermediate result estimates the length of transverse and aligned edges. 

Lemma 5.5.22. Let [v,w] be an edge ofT and let z G [v, w]. 

1. If [v, w] is transverse, then \v — w\ < fi(z)/2. More precisely max{|t> — x\, \w — x\} < 
n{z)/& for all x G Vor(t>) D Vor(w) fl Y . 
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2. In any case \\v — w\\h( z ) < 5. 

Proof: We first establish Point 1., hence we assume that [v,w] is transverse and we 
consider a point x G Vor(f) fl Vor(w) R T. For notational simplicity we denote := 4/19. 



It follows from Lemma 5.5.21 that \v—x\ < (3jj,(v) < f3(/j(x)+c\v—x\) } hence \v— x\ < (3(1 — 
c/3) _1 /i(x). Likewise \w — x\ < (3(1 — c(3)^ 1 ji(x). Furthermore \z — x\ < max{\v — x\, \w — x\} 
since z G [v,w], hence 

n(x) < fi(z) + c\z — x\ < fj,(z) + c(3[l — c/3)^ 1 fx(x) 

which implies fi(x) < (1 — c/3(l — c/3) _1 )~ 1 yu(2;). Therefore 

\v -x\< 13(1 - cp)- l n(x) < /3(1 - cp)-\\ - c(3(l - 

The term in front of n(z) in the right hand side tends to (3 = 4/19 < 1/4 as c — > 0. 
If c is sufficiently small we therefore obtain as announced \v — x\ < fi(z)/A. Likewise 
\w — x\ < n(z)/A and therefore \v — w\ < ji(z)/2. 



We now turn to the proof of the second point. We have according to Lemma 5.5.10 
adff(v, w) = d a 2 H (v, w) < —2 ln(2 — e c )/c' 

where d := c/a. Hence 

d H (v,w) < r (c) := -21n(2 - e c/a )/c. 
If follows that \\v — w\\h(v) < r i(c) := (e cr °^ — l)/c, which implies using (5.53) that 



\\v - w\\ H{z) < \\v - w\\ H(v) (l - c\\v - z\\ H (v)) 1 < r 2(c) := n(c)(l - cri(c)) 1 

for any z G [v , w]. As c — >■ the functions r , ri and r 2 all tend to 2(1 + 1/4) < 5. Choosing 
c sufficiently small we conclude the proof of this proposition. o 

The next lemma describes the orientations of the transverse and aligned edges of T, 
in regions of sufficient anisotropy. We introduce the constant 

p := max{50/sin(7T/12), 11C }, 



where Cq is the constant from Lemma 5.5.20| (we only use in the sequel that T is Cq 
equivalent to the metric H). 

For any v, w G V, we say that [v, w] is an edge of T if 

(v,w)cT\V, 

where (v, w) denotes the relative interior of the segment [v, w]. The next lemma characte- 
rizes the transverse edges of T in regions where the anisotropy is sufficiently pronounced. 
For any x G V 

T+ := {v' G T„ ; (6(x) ± , v' - x) > 0}, 

and we define T~ similarly. If [v,w] is an edge of T, then one easily checks that w is the 
closest element to v in Tt H V or r~ H V. 
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Lemma 5.5.23. The following holds if c is sufficiently small. Letv,w £ V be a transverse 
edge, and assume that there exists z £ [v,w] such that p(z) > p . Then [v,w] is and edge 
ofT, Vor(f) D Vor(w) R V is a singleton {x}, and x £ [i^u?]. 

Proof: We first consider an transverse edge [t>, w], a point z £ [v,w] such that p(z) > po, 



and a point x £ Vor(t>) fl Vor(w) fl T. According to Lemma 5.5.21 we have x £ r„ fl T w , 
hence v, w £ T x . 

We may assume without loss of generality that v £ F+. We denote by v' the point 
of which is the closest to x, and our first objective is to to establish that v = v'. We 
assume for contradiction that this is not the case. 

It follows from Lemma 5.5.17 that the segment [x, v] and the line v' + 9(v')M intersect. 
Hence there exists a £ [0, 1] and r £ K such that 



x + a(v — x) — v' + r6{v') 
Let u be a unit vector orthogonal to v — x. We have (x,u) = (v',u) + r(9(v'),u), and 



therefore using Lemma 5.5.17 



\r\/2 < \r\ cos <(6(v') ± , v — x) 

= \r(e(v'),u)\ 

= \{x-v',u)\ 

< \x — v'\ < \x — v I < /i(z)/4 



where we used and Point 2. of Lemma [5.5.211 in the last. 
According to Lemma 3 (Visibility Lemma) in 



66 the Voronoi regions of a wedged 



Voronoi diagram are star-shaped. It thus follows from Lemma 5.5.21| that the segment 
[x,v] does not intersect B H {v' , 1/25), hence \r\ > A(V)/25 which yields 

A(V) - c\z - v'\ < \(v') < p(z)2h/2. 

It follows from Lemma 15.5.221 that 



\z — v'\ < \z — x\ 



\x 



v'\ < max{|t> — x\, \w — x\} + \x — v\ < p(z)/2. 



We thus obtain 25/2 + c/2 > p(z) > po which is a contradiction since c < 1. This 



contradiction is illustrated on Figure 5.10 (left). 

Hence v = v' is the point of the closest from x. If w £ T+, then we obtain w = v' = v 
which is a contradiction. Hence w £ T~ , and reasoning similarly we find that w is the 
point of T~ the closest to x. This implies that [v,w] is the unique edge of T containing x 
(note that the only points of V which belong to two edges of T are some vertices v £ V. 
But x (fi V since any vertex v £ V belongs to the interior of its own Voronoi region). 

Assume for contradiction that there exists another point x' £ Vor(f ) fl Vor(w) flT. The 
above argument shows that [v,w] is the edge of T containing x' . Therefore x,x' £ [u, w], 
but this implies x = x' since the Voronoi regions are star shaped. o 



Our next purpose is to estimate the orientations of the edges of T. 
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Figure 5.10 - Illustration of the proof of Lemmas 5.5.23 (left) and 5.5.25 (right) 



Lemma 5.5.24. The following holds if c is sufficiently small. Let [v,w] be an edge of T 
such that w (/iT v and assume that p(z) > po for some z G [v, w] . Then 

<{6(z),w-v) < 7T/12. 

Proof: It follows from Lemma [5.5.201 that 

\w — v\ 

sin <(0(z), w — v) < \\w — v\\h(z) < 5. 



On the other hand Proposition 5.5.18 yields since w ffc T v 

\v-w\> A(v)10/91 > (X(z) - c\z - w|)10/91, 

therefore \v — w\/X(z) > 10/(91 + 10c/91) which implies that \v — w\ > X(z)/10 if c is 
sufficiently small. It follows that 



sin <(0(z), w —v)< 



50 



which concludes the proof since po ^ 50/ sin(7T/12). 







The next lemma establishes that the aligned edges of T their name indicates, 

aligned with the direction 9 of anisotropy in regions where this anisotropy is sufficiently 
pronounced. 

Lemma 5.5.25. The following holds if c is sufficiently small. 

1. For any vertex v G V such that p{y) > po, there exists two aligned edges [v,w], 
[v,w f ] of T such that 



<(0(v),w-v) <7T/12 and <(#(f), v — w') < 7T/12 



(5.144) 



2. Consider an aligned edge [v,v'\ ofT and a point z G [v,v'\ such that p(z) > po + 1. 
Then <(6(z),w-v) < 7T/12. 
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Proof: We first establish Point 1., and as an intermediate objective we prove that T v \ 



Vor(t>) 7^ 0. It follows from the definition (5.140) of T v that there exists an edge [t>o,Wo] 
of 7o such that v G [v , (v + 2w )/3] C T v . Therefore according to (5.133) 



max{|t> — vq\, \v — (vq + 2u> )/3|} > \v — w \/3 > \(v)\/2/9. 



On the other hand \z — v\ < 4p(v)/19 for all z G Vor(w) D r„ according to Lemma 5.5.21 
omce p(v) > Po > 36/(19\/2) we find that one of the points vq or u> does not belong to 
Vor(v). 

Consider a point x G T v fl d Vor(u). There exists vertex id, 6 V such that x G Vor(w*) 
and is an edge of T. Therefore [v, w*] is a transverse edge of T such that x G 

Vor(v) nVor(w t ) PI T and p(v) > po- This implies according to Lemma 5.5.23 that 
is an edge of T containing x. 

Let T,T' G T be the two triangles containing the edge [u, w*], and let w,w' G V be 
the third vertex of these triangles respectively. Since T is Co-adapted to the metric H, we 



obtain using (5.10) 



\v — w* 



+ \v — w\ > diam(T) > 



\u 2 



> \\H(v)-*\\/C Q = \(v)/C . 



(5.145) 



If w G T„, then we obtain using Lemma 5.5.17 and Lemma 5.5.22 that 



\v — w\/(2fi(v)) < \v — w\ cos(<(9(v) ± , w — v))/(jl(v) < \\v — w\\h( v ) < 5, 



and \v — w*| < fi(v)/2. Injecting this in (5.145) we obtain u.(v)(l0 + 1/2) < X(v)/Cq 



hence p(v) < (10 + 1/2)Cq which is a contradiction. Therefore w T v , which implies 



<(8(v),w — v) < 7T/12 according to Lemma 5.5.24 Hence 



<{6(v),W - v) G [0, 7T/12] U [117T/12, 7T]. 

and likewise <(6 l (f),i(7 / — u) G [0,7T/12] U [117T/12, 7T]. On the other hand we have 
<(9(v) ± ,w* — v ) < 7T/3 according to Lemma 5.5.17, and therefore <(6 , (f),w* — v) G 



[7T/6, 57T/6]. Since the triangles T and T" are on opposite sides of the edge [i>,w*] we 
obtain ( |5.144 ) which concludes the proof of Point 1. 

- z\/\(v) < \\v - w\\ H (z) < 5. 



We now turn to the proof of Point 2. We have 



Assuming that c < c p (po, 5) we therefore obtain p(y) > po- If v' ^ r„, then Lemma 5.5.24 



gives the announced result. We therefore assume for contradiction that v' G r„, and we 
may assume without loss of generality that v' G T+. It follows that \ Vor(t>) ^ 0. 
Repeating our previous argument we consider a point x G fl<9 Vor(t>), and we find that 
there exists a vertex w* G such that [v, it)*] is an edge common to T and T containing 
x. We denote by T, T' the two triangles containing this edge and by w, w' the third vertex 
of these triangles. 

Since v' G we have <(6 l (f)- L ,f — v') < 7T/3 according to Lemma 5.5.17, hence 
<(6*(f),i/ — v) G [tt/6, 57T/6]. Since satisfy (5.144) we obtain that intersects 



one of the triangles T or T', which is a contradiction, as illustrated on Figure 5.10 (right). 
This concludes the proof of this lemma. 
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For each v E Vo such that p(v) > 2 + p we define T' v := [v,W-] U [u, where 
u>_, w + E Vo are such that r„ := [v, (2u>_ + v)/3] U [v, (2w + + v)/3\. We denote by T' the 
union of these sets 

r':= U C 



p(v)>2+p 



which is a union of segments joining some vertices in Vo, hence in V according to Lemma 
5.5.19 We denote by Q the partition into convex polygons of IR 2 obtained by bisecting 



the triangles T e T by these segments. Such a partition is generally referred to as the 
overlay T and T'. 

Proposition 5.5.26. The following holds if c is sufficiently small. Denote by T' the 
triangulation obtained by arbitrarily triangulating the convex cells of Q. Then T' is a 
^-refinement of T and S(T) < max{C 2 (3 + p ), tan(ll7r/24)} for all T E V. 

Proof: Our first objective is to give a uniform bound on the measure of sliverness in 
T' i and for that purpose we denote by V the collection of vertices of T 7 . We have by 
construction VcTU V . 

Assuming that c < c p (p + 1, 1), we obtain that p(z) > p + 1 for all z E V. 

Consider a vertex z EV', and assume in a first time that z E V . It follows from Lemma 



5.5.17 that there exists two edges [z,v], [z,v'\ of T', contained in T' } and such that 
<(6{z) L ,v-z) <7T/3 and <(^(z) ± , z - v') < 7T/3. 



Furthermore, it follows from Lemma 5.5.25 that there exists two edges [z,u>], [z, w'], of 
T' such that 

<{6{v),w - z) < 7T/12 and <{9{v), z - w') < 7T/12. 

In more detail two cases are possible : either z is a vertex of the original triangulation T, 
and Point 1. of Lemma 5.5.25 applies, or z is a vertex of T' created by the overlay of V 
and T. In that case Point 2. of Lemma 5.5.25 applies and the edges [z,w], [z,w'], of T' 
are contained in the same aligned edge of T . 
We thus have 

<{w-z,v-z) < <{w - z,6(z)) + <(0(z),6(z) r ) + <(e{z)^,v - z) 
< 7T/12 + 7r/2 + 7r/3 
= 117T/12. 

Likewise 



<(v-z,w'-z) < 117T/12, <(w'-z,v'-z) < 117T/12 and <(i/-z,w-z) < 117T/12, 

which immediately implies that any angle at the vertex z is bounded by 117T/12. 

In a second time we consider a vertex z E V \ V . Since V C T U T' there exists 
v E Vo such that z E T v . If p(z) > Po + 3 and c < c p (p + 3, 1) then p{y) > p + 2 and 
therefore z E V, which is a contradiction. Therefore p(z) < Po + 3, which implies that the 



272 



Chapter 5. Are riemannian metrics equivalent to simplicial meshes ? 



triangles T e T containing z satisfy S(T) < p(T) < Cqp(z) < Cq(po + 3). Any angle 6 
at the vertex z in the triangulation T, hence also in the triangulation T', thus satisfies 
tan(#/2) < Cq(Po + 3), which concludes the proof of the upper bound of the measure of 
sliverness 5 on T'. 

We now prove that T' is a bounded refinement of T, and for that purpose we consider 
an edge e = [v,w] of T, and we denote z :— (v + w)/2. We have by construction \v — 
iv\/X(z) < \\v— w\\ H (z) < 5. If a triangle T e % intersects [v , w], then for any z' G [v,w](~)T 

\{z)(l - 5c/2) < X(z) - c\z - z'\ < X(z') < \{z) + c\z - z'\ < X(z)(l + 5c/2). 

Recalling that 2\(z')/3 < diam(T) < X(z') we obtain 

T C B(z, 5X(z)/2 + X(z')) C B(z, X(z)(5/2 + 1 + c5/2)), 

and in the other hand since T is a half square 

y^2}{f\ = diam(T) > 2A(V)/3 > 2X{z)(l - 5c/2)/3. 

Comparing the areas we obtain that the number of triangles T <E % intersecting [v,w] is 
bounded by 

/ 5/2 + l + 5c/2 \ 2 
77 V 2(1 - 5c/2)/3 J 

which is smaller than 100 if c is sufficiently small. 

It follows that any edge [v,w] of T is cut in at most 100 parts by the set T. For any 
T E T we thus obtain #(<9T D V) < 3 x 100 = 300. Since any conforming triangulation 
of the convex envelope of n + 2 points uses n triangles, we obtain that the triangulation 
T' is a 298 refinement of T which concludes the proof of this proposition. o 
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6.1 Introduction 

The main purpose of adaptive mesh generation, compared uniform mesh generation, is 
to reduce the number of simplices required to achieve a given task. Here are two relevant 
examples which are further discussed in this chapter : 
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1. Given two domains of interest, generate the mesh with smallest possible cardinality 
that separates these domains with a layer of simplices. 

2. Given a function /, generate the mesh of a prescribed cardinality N that yields the 
best finite element approximation to / of a given order m — 1 in some prescribed 
norm LP or W 1,p . 

Both examples can be viewed as optimization problems posed on the set of meshes. The 
second problem was dealt with in Chapter 2 and 3 in the case where / is smooth, and 
an optimal mesh was constructed in the asymptotic regime N — > +oo. The results of the 
previous chapter have revealed that certain relevant classes of meshes can be equivalently 
described in terms of riemannian metrics, and a natural objective is therefore to translate 
the above problems as optimization problems posed on the set of metrics. Optimization 
problems posed on metrics may indeed be easier to solve practically due to the continuous 
nature of these objects. The goal of this chapter is to explain how this objective can be 
met for the above two examples. The dimension d > 2 is fixed throughout this chapter, we 
denote by T the collection of conforming simplicial meshes of R d and by H := C°(IR d , S£) 
the collection of metrics. 

In the previous chapter, we have introduced the classes of meshes 

Tj,C C T ai c C Tg,C, 

and characterized the respectively equivalent classes of metrics 

Hj C H a C Hg. 

These classes of meshes and metrics are defined on the entire unbounded domain M. d . 
The above optimization problems do not make sense in this setting since the number 
of elements in such meshes is infinite. In order to circumvent this difficulty while still 
avoiding the difficulties related to the boundary of a domain, we work in a periodized 
setting that we describe below. 

6.1.1 Periodic meshes and metrics 

We denote by T per the collection of meshes T G T which satisfy the following proper- 
ties : 

i) (Translation invariance) For any T G T and any u G 7L d , one has u + T G 7L d . 

ii) (Constrained diameters) For any T G T one has %t > Id. 

Accordingly, we denote by H per the collection of metrics H G H which satisfy the 
following properties : 

i) (Translation invariance) For any zeR 1 * and any u G 7L d , one has H(z) = H(z + u). 

ii) (Constrained positivity) For any z G R d one has H(z) > Id. 

A mesh T G T pcr can be regarded, thanks to the translation invariance property, as a 
mesh of the compact periodic space 



U d := (R/7L) d , 



(6.1) 
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which is also called the <i-dimensional torus. The diameter constraint property ensures 
that the simplices T of a periodic mesh T E T pcr do not have a significantly larger size 
than the fundamental cell [0, l] d of the standard 7L d periodic tiling of lR d . 

Strictly speaking, a periodic mesh T E T per has an infinite cardinality. However it 
only contains a finite number of simplices up to translation by an element of TL d . For any 
T G T pcr , we therefore denote by #(T) the number of equivalence classes in T for the 
relation 

T ~ T' if and only if there exists u G 7L d such that T' = T + u. 

The number #(T) is finite, and is the the cardinality of T seen as a mesh of the compact 
periodic space (6.1). We define the mass m(H) of a periodic metric H G H per as follows 



m(H) := [ Vdet H. 
J\o.i\ d 



[0,1]° 

The next proposition shows that the mass of a metric H and the cardinality of a mesh 
T are close when H and T are equivalent in the sense studied in the previous chapter. 
We denote by T eq a fixed equilateral simplex centered at the origin of M. d and having its 
vertices on the unit sphere. 

Proposition 6.1.1. The cardinality of a mesh T G T per and the mass of a metric H G 
Hp 0r satisfy the following properties : 

- IfU T < C 2 H{z) for allT eT and all zeT, then |T cq |#(T) < C d m{H). 

- IfH(z) < C 2 H T for allT ET and all zET, then m(H) < C d \T eq \#(T). 



In particular, if T is C -adapted to H in the sense of Definition 5.1.1 , then 

C- d m{H) < |T cq | #(T) < C d m{H). (6.2) 
Proof: For any z E 1R we denote by T(z) a simplex T G T containing z. Recalling (see 



Proposition 5.1.3 in the previous chapter) that for any simplex T 

\T\^detn T = |T e , 



-eqh 



we obtain 



#(T) = / = ^ / JdetH T{z) dz. (6.3) 



[o,i] d \T(z)\ |T eq | 7[ 01 ] 



Furthermore we have det 5* < det 5" for any S, S' E S£ satisfying S < S'. Applying this 
remark to the matrices %t{£\ an d C 2 H(z) or C~ 2 H(z) we obtain the announced inequa- 
lities, which concludes the proof of the proposition. o 

The equivalence between classes of meshes and metrics of lR d established in Theorem 



5.1.14 of the previous chapter has an immediate generalization to periodic meshes and 



metrics. 



Theorem 6.1.2. There exists a constant Co = Co(d) such that for all C > Co, 

1. The collection of meshes Tj c H T pcr is equivalent to the collection of metrics Hj D 



Hpor ■ 
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2. If d — 2, then the collection of meshes T a ,c H T pcr is equivalent to the collection of 



metrics H a D H per . 



3. If d = 2, then the collection of meshes T 9t c H T pcr is equivalent to the collection of 



metrics H g D H per . 



Proof: The proof of Theorem 5.1.14 presented in the previous chapter can be adapted to 
periodic metrics and triangulations with only slight changes, which are left to the reader, o 



6.1.2 Compactness of metrics 

In order to ensure the existence of an "optimal metric" for a number of optimiza- 
tion problems posed on periodic and isotropic, quasi-acute or graded metrics, we need a 
property of compactness for such metrics. 

We equip the set H per of continuous and 7L d periodic metrics with a distance d pev 
which is defined as follows : for all H, H' G H per 

d pcv (H,H'):= sup d x (H(z),H(z)) = sup d x (H(z),H'(z)). 

ze[o,i] d zeR d 

Consider H, H' G H per and define 5 = d pei (H, H'). We thus have for all z G M d , 

e~ 2S H(z) < H'(z) < e 25 H(z). (6.4) 

It immediately follows that 

e- dS m(H) < m(H') < e d5 m(H), (6.5) 

and 

e~ s d H (x,y) < d H >(x,y) < e 5 d H (x,y). (6.6) 

which shows that the mass m i— > m(H) and the distance between to fixed points H i— > 
dn{x,y) depend continuously on H G H pcr . 

We establish in §6.2 the following compactness property : for any M > 1 and any 
symbol * G {i, a,g} the collection of metrics 

{H G H pcr n H, ; m(H) < M} 

is a compact subset of H pcr . 



6.1.3 Separation of two domains 

Isotropic and anisotropic mesh adaptation is often used to separate some geometric 
sets, in the sense given by the following definition. For any closed subset E C M d and any 
mesh TeTwe define the neighborhood Vj-(E) of E in T as follows : 

V T (E):= |J T. 
TeT, 
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We say that a mesh TgT separates two closed sets X, Y C IR d if and only if 

v r (x) n VVQO = 0, 

in other words if X and Y have disjoint neighborhoods in the mesh T. 

A natural objective is to build a mesh of minimal cardinality which separates two 
given regions. Consider two closed, disjoint and 2Z d -periodic sets X,Y C M. d . We consider 



in §6.3| the optimization problem posed on meshes 

m^CoiX, Y) := min{#(T) ; T G T pcr fl T* jC < and T separates X and Y} 

where * G {2,0,(7} is a symbol and Co > 1 is a constant. We establish that this problem 
is equivalent to the following optimization problem posed on metrics, if the dimension is 
d = 2 : 

m*(A, Y) := mm{m(H) ; G H pcr fl H* and H separates X and Y}, 

where we say that a metric H separates X and Y if 

di{{x,y) > 1 for all (x,y) G X x Y. 

Precisely, we show that there exists a constant C = C(C ) > 1, independent of the sets 
X,Y, and such that 

C-V(X,F) < m, iCo (X,F) < Cm,(X,F). 

Imposing more constraints on the triangulation typically raises the number of triangles 
required to achieve a given task, and likewise assuming more regularity of the metric 
raises the mass required for a given task. We illustrate this remark in the context of 
the separation of domains as follows. We fix the parameter r := 1/4 and we define the 
periodic set 

X :={x &R d ; d(x,7Z d ) < r }, 
where d(x, E) := min{|x — e| ; e G E}, and for any < 5 < 

Y s := {yeR d ; d(y,7Z d ) > r + 5}. 



The sets X and Yg are illustrated on Figure 6^ The following equivalences are established 
as 5 —> 

m t (X,Y s ) ~ 5-^, 
m a (X,Y s ) ~ r^|ln5|, 
m g (X,Y s ) ~ 



A construction closely related to the separation of the sets X and Y$ is presented on 
Figure ffl in the main introduction of this thesis. 
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Figure 6.1 - The sets X (light gray) and Y s (dark gray), with d = 2 and 5 = 0.1. 

6.1.4 Approximation in a given norm 

One of the key purposes of anisotropic mesh generation is the adaptive approximation 
of functions, as discussed extensively in Chapters 2 and 3. In order to study this problem 
from the point of view of metrics, we introduce an error e#(/)p associated to a function 
and a metric, and we compare this quantity with the approximation error of / on a mesh. 

For that purpose we denote by Bjj(z) the open ball of radius 1 around z for the norm 
|| • \\h(z) (therefore an ellipse) : 

B H (z) := {z + u; \\u\\ H { z ) < 1}- 

For any exponent p, 1 < p < oo, and any / G Lf oc (IR d ) we denote by e# (/; z) p the error 
of best approximation of / on B H (z) : 

e H (f;z) p := inf ||/ - h\\ L p{b h {z))- (6.7) 

We denote by L^ er the collection of functions / G Lf oc (IR d ) which are 2Z d -periodic. For all 
/ G L^ cr and all H G H per we define the approximation error en{f)p of / associated to 
the metric H as follows 



e H (fY p := / y/det H(z) e H (f; zfdz. (6.f 
J\o,i] d 



'[0,1] 

Consider a metric H G H pcr D H g and a mesh T G T per such that 

H T > 4: 2 H(z), zeT,T G T, (6.9) 

which heuristically means that that the mesh T is "more refined" than the metric H. We 
establish that for any / G L^ er 

inf \\f-g\\ p <Ce H (f) p (6.10) 

geVm-i(T) 

where Vfc(T) stands for the space of finite elements of degree k on T, and C is an absolute 
constant. Furthermore we give an explicit expression of a finite element approximation 



g G VkiT) which satisfies (6.10), namely 



<?:=ir/*', (6.11) 
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where H' := 4 2 H. We denote by fn the convolution distorted by the metric if of a function 
/ with a fixed compactly supported mollifier ip (which satisfies a moments condition) 



JR d 



'■u)(p(\u\)du. 



The extension of the approximation result (6.10) to the W 1,p semi-norm is not trivial 



due to the fact that interpolation on simplices of large measure of sliverness is not a stable 
procedure. 

For any vector field v G L^ oc (M. d , Mr) and any metric if G H we introduce the two 
quantities 



e H {v : z) p 



mm \\v - v\\lv{b h {z)) 



and 



Note that 



e 9 H (v; z) p := \\H(z)\\* min \\H(z) Hv - u)\\ lp{Bh(z)) . 



e a H (v; z) p < e 9 H (v; z) t 



(6.12) 
(6.13) 
(6.14) 



with equality if H (z) is proportionnal to Id. If v is 7L d periodic and if H G H per , then we 
define e a H {y) p and e 9 H (v) p similarly to (6.8). 



Consider a metric H G H pcr nH a and mesh T G T pcr which satisfy (6.9). We establish 
that for any / G 

(6.15) 



inf || V(/ -g)\\ p < Ce a H (yf) p max^(T), 
where C = C(d). 

This estimate should be compared with the following. Consider a metric H G H pcr nH s 



and a mesh T G T which satisfy (6.9) and in addition CqH(z) > T-Lt for all T G T and 
z G T. We establish that 



inf 

g eV m -i(T) 



\\V{f-g)\\ P <Cc a H {Vf) 



V 



(6.16) 



where C = C{C ,d). 



The function g given by (6.11) satisfies both estimates (6.15) and (6.16). These results 



put in light the price to pay to obtain the best error estimate (6.15) in the W 1,p norm : 



the measure of sliverness should be uniformly bounded on T, and the metric H should 
belong to H a . If these conditions are not satisfied then we can only prove the estimate 
(6.16), which penalizes the anisotropy of the metric H as observed in (6.14). Note that 



according to Theorem 6.1.2, if the dimension is d 
there exists a mesh 7o G T a> c D T 



2 then for any metric H G H a D H pcr 
pcr which is C'-equivalent to H, where C and C are 



absolute constants. By definition of quasi-acute meshes there exists a C-refinement T of 



7o on which the measure of sliverness is uniformly bounded by C, which satisfies (6.9), 
and such that #(T) < C"m(H) where C" is an absolute constant. 
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6.1.5 Asymptotic approximation and explicit metrics 

We establish some counterparts for metrics of the asymptotic estimates presented in 
Chapters 2 and 3 for triangulations and more general meshes. Our estimates hold without 
restriction on the degree m — 1 of interpolation, m > 2, or the dimension d > 2. They 
apply to a sufficiently smooth function and to metrics of asymptotically large mass. 

We denote by H m the vector space of all homogeneous polynomials of degree m in d 
variables, equipped with the norm 

||7r|| := sup |7r(u)|. 
M<i 

We define the shape function K as follows : for all 7r G H m 

K(tt) = inf ||tto A\\. (6.17) 

A&SL d 



We establish that for any 7L d periodic and C m function 



hmsup M? inf e H (f) p | < C\\K((Tf)\\ T , (6.18) 

M->oo \ WeH s 
\ m(H)<M 

where the exponent r is defined by ^ := ^ + K 

The proof of this result would be greatly simplified if one could establish the existence 
of a smooth map tc G H m !->■ A(ir) G SL^ such that K(ir) = \\n o A(7r)||. This is unfortu- 
nately not the case, and the key ingredient of the proof of ( 6.18[ ) is the definition and the 



analysis of a family of functions which tend to K as a — > oo and which are defined by 
a well posed optimization problem (for which there exists a unique minimizer depending 



continuously on the parameter it). We also prove in £6.5.5 that the shape function K is 
uniformly equivalent to a continuous function on H m , as well as the shape functions L g 



and L g defined below. 



In the case of W l,p norms we introduce two distinct shape functions 

L a (n) := inf \\(Vtt) o A\\, 

AfcbL d 

L g (n) := inf || V(vr o A)\\, 



where H m _ 1 is equipped with the norm 



sup \v(u) 
H=i 



We establish similar results to (6.18), for any C m and TL periodic function / 



(6.19) 
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and 



limsup M^r- in f e 9 H {V f) p \ < C\\L 9 {d m f)\\ T 

M-S-OO \ fl ™9 

m(H)<M 



(6.20) 



where the exponent r is defined by - := 



m— 1 I 1 



For a quadratic or cubic (in two dimensions) homogeneous polynomial ir, we give an 
explicit expression of near-minimizers of the optimization problems defining L a (7r) and 
Lg(7i) in terms of the coefficients of tt. This expression can be used to compute efficiently 
in numerical applications a metric H adapted to the approximated function / using the 
available information on the derivatives of /. 

We also use these minimizers to compute the ratio L a {n) / L g {ii) < 1, and we discuss 
for which polynomials this ratio is significantly small. In other words for which types of 
anisotropic features of the approximated function / the use of quasi-acute triangulations 
leads to a significantly smaller approximation error than the use of graded anisotropic 
triangulations (which is the case in all numerical software known to the author). 



6.2 Compactness of metrics of a given mass 

The purpose of this section is to establish the following compactness result. We recall 
that the collection H per of 7L d periodic metrics is equipped with the distance 

d pei (H,H') := sup d x (H(z),H'(z)). 

Theorem 6.2.1. For any symbolic £ {i,a,g} and any mo > 1 the collection of metrics 

{H £ H pcr n H, ; m(H) < m } (6.21) 

is a compact subset o/H per 

The rest of this section is devoted to the proof of this theorem. Our first intermediate 



result establishes that the set (6.21) is closed in H 



"■per- 



Lemma 6.2.2. The set (6.21) is closed in H per; as well as the sets Hj,H a ,H 



Proof: As observed in \ 6.1.2, the pointwise evaluation H \— > H(z) at a fixed point z G Mr, 
the distance H ^ dn{x,y) between two fixed points x,y £ lR d and the mass H i-> m(H) 



define continuous maps on H per , see (6.4), (6.5) and (6.5). For any fixed x, y £ K , the 



collection of all metrics H £ H pcr which satisfy one of the following conditions 

d x (H(x),H(y)) < d H {x,y), 
d+(H{x),H{y)) < \x-y\, 



\H(x)\\\\H(x) 



m(H) < m , 
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is therefore a closed subset of H per . 



A metric H belongs to H 9 (resp. H a , resp. Hi) if and only if it satisfies the first of 
these inequalities for all x,y G lR d (resp. the two first, resp. the three first). The sets Hj, 
H a and H g are thus an intersection of closed subsets of H per , hence are closed. The set 



(6.21) is obtained by imposing the fourth inequality, and is therefore also closed. o 



We recall a classical result which states that the collection of Lipschitz functions bet- 
ween two compact metric spaces is itself a compact metric space. It is an immediate 
consequence of Ascoli theorem. 

Theorem 6.2.3. If two metric spaces (X, dx) and (Y, dy) are compact, then the collection 
Lip(X, Y) of Lipschitz functions from X to Y is also compact when equipped with the 
distance 

d(f,g) ■= maxd Y (f(x),g(x)). 

We use this theorem to establish that closed and bounded subsets of H per fl H 9 are 
compact. 

Lemma 6.2.4. Any closed and bounded subset H* o/H por fl H g is compact. 

Proof: Since the collection H* of metrics is bounded, the following quantity M is finite 

M 2 := sup sup 
HeH, ze[o,i] d 

For any x, y G M. d and any H G H* we thus have djj(x, y) < M\x — y\. 

We consider the compact metric space X := [0, l] d , equipped with the distance 
dx(x,y) := M\x — y\, and the compact metric space 

Y := {M G Sj ; Id < M < C 2 Id} 



equipped with the distance d x . According to Theorem 6.2.3 the metric space Lip(X, Y) 
is thus compact. 

Consider a sequence (H n ) n > of elements of H*. For all n > 0, the restriction H n ^ 01 ^d 
is an element of Lip(X, Y). Since this space is compact there exists a sub-sequence H<p( n ) 
which converges uniformly on [0, l] d . Recalling that H n is periodic for all n > we find 
that ii/^jn) converges uniformly to a periodic metric H G H per . 

Since H* is closed in H per fl H ff , and since H pcr fl H g is closed in H per , we obtain that 
H* is closed in H per and therefore H G which concludes the proof. 



The next lemma establishes that the set (6.21 ) appearing in Theorem 6.2.1 is bounded. 



This set is closed according to Lemma 6.2.2 hence compact according to Lemma 6.2.4 



which concludes the proof of Theorem 6.2.1 



Lemma 6.2.5. Let m > 1 be a constant. 
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1. There exists C = C(mo, d) such that for any H G H g satisfying H(z) > Id for all \z\ < 
1 and 

/ \/ det H{z)dz < m 

J\z\<l 

we have \\H(0)\\ < C 2 . 

2. The set {H G H per fl H 9 ; m(H) < m } is bounded. 

Proof: We first show how 2. can be obtained assuming 1., and for that purpose we 
consider H G H pcr fl H g and z*. The ball {z G lR d ; \z — z*| < 1} is covered by the 2 d 
cubes z* + u + [0, l] d where u G { — 1, 0} d . Consequently 



/ y/det H(z)dz < Yl [ VdetH = 2 L 



m 



and we already know that H(z) > Id for all z G M d . It follows that ||i?(2:*)|| < C' 2 where 
C is the constant associated to 2 d mo in Point 1., which concludes the proof of Point 2. 

We now turn to the proof of Point 1. At each point z G lR d we denote the eigenvalues 
of H(z) by 

Xl(z)>...>X 2 (z). 

For each k G {1, • • • ,d} the function : M. d — > M.* + is continuous and X(z) > 1 for all 
\z\ < 1. 

Our purpose is to obtain an upper bound for Ai(0) = \/\\H (0) || in terms of m . We 
shall proceed as follows : we first give an upper bound for Xd(0) that depends only on m , 
and then for each 1 < k < d — lwe give an upper bound for Afc(0) in terms of mo and 
A*+i(0). 

The upper bound on A^(0). We define the ellipse 

£:={zeR d ; \\z\\ H{0) < X d (0)} 



and we observe that \z\ < 1 for all z E £. We thus obtain using Corollary 5.2.11 that 
c d lnA d (0)< / a/ 'det H(z)dz < / ^det H(z)dz < m , 

where Cd > is a constant which depends only on the dimension d. This gives as announced 
an upper bound on Xd(0) in terms of mo- 

The upper bound on A/^O). The vector space lR d is the orthogonal sum 

R d = u © V 

where U is the sum of the eigenspaces associated to the eigenvalues Af (0), • • • , A^(0) of 
H(0) and V is associated to the other eigenvalues A^ +1 (0), ■ ■ ■ , A^(0) of H(0). We have 
for all u G U and all v G V, 

\\u\\h(o) > A fc (0)|tt| and ||u||h(o) < X k+1 (0)\v\. 
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We consider an isometry P G Od,k (in the sense that P T P is the k x k identity matrix) 
satisfying P{R k ) = U . For each v G V we define a metric H v on IR fc as follows : for all 



u e 



H v (u) := P T H v (v + Pu)P. 



It follows from Proposition 5.2.2| (in the previous chapter) that H v belongs to the collection 
H 9 (IR fc ) of graded metrics onr. We define 

B v := {u G R k ; \u\ < 1/2} and B v := {v G V ; X k+1 (0)\v\ < 1/2}, 

as we observe that 

\Pu + v\<\ for all (u, v) G B v x By. (6.22) 
Recalling that \{z) > 1 for all 1 < i < d and \z\ < 1, we obtain for all (u, v) G Bu x .By 

det H v (u) < \\{Pu + u) • • • X 2 k (Pu + v) 
< \\{Pu + v) ■ ■ ■ \ 2 d (Pu + v) 
= det H(Pu + v). 

For any u G lR fc and any «6l^we have according to (5.53) 



(6.23) 



MI-ff„(o) = \\Pu\\h{v) > (1 - IMI#(o))||-Pw||h( 2 *) > A fc (0) |w|/2, 



(6.24) 



and therefore 



{u G 



«lk(o) < A fc (0)/4} C B v . 



Denoting by c k the constant from Corollary 5.2.11 applied in dimension k, we therefore 
obtain for any v G By 



V 'det H v {u)du > c fc m(A fe (0)/4). 



(6.25) 



We thus obtain successively 

/ y/ det H(z)dz > / y/det H(Pu + v)dudv 

J\z\<l Jv£B v Ju&Bu 



> 



> 



a/ det H v (u)dudv 



VGBy J U^Bjj 

c k \n(\ k (0)/4)dv 

veB v 

B v \c k \n{X k (0)/4) 



(2A fc+1 (0))^ 



c fc m(A fc (0)/4), 



where we used (6.22) and the Fubini integration formula in the first line, (6.23) in the 
second line, and (6.25) in the third line. In the last line we denote by u)d-k the volume of 
the unit euclidean ball in 

This inequality yields an upper bound on A&(0) in terms of M and Afc+i(0) 

f m (2A fc+1 (0)) rf - ' 
Afe(O) < 4exp 

which concludes the proof of this theorem. 
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6.3 Separation of two domains 

We study in this section a geometrical problem : the separation of two sets using 



a mesh of minimal cardinality. Our first result, Proposition 6.3.1 establishes that this 
problem has an equivalent formulation in terms of metrics. 

Proposition 6.3.1. We assume that the collection of meshes T pcr nT i]C[) and the collec- 
tion of metrics H per fl H* are equivalent in dimension d, where * G {a, b, c} is a symbol 



and Co > 1 is a constant (According to Theorem 6.1.2, this condition holds at least if 
d = 2 and Co is sufficiently large). 

There exists a constant C > 1 such that the following holds : 
Let 1,7 ci d be two closed, disjoint and periodic sets, in the sense that 

x + u G X and y + u EY, for any x £ X, y E Y and u G 7L d . 

Consider the two optimization problems 

m* 5 c*o(^5 Y) := min{#(T) ; T G T per fl T*,c and T separates X and Y}, 
m*(X, Y) := mm{m(H) ; H G H per D H* and H separates Y and Y}. 

Then 

C- l m^X,Y) < m* iCo (X,Y) < Cm+(X,Y). (6.26) 

Proof: We denote by C\ > 1 a constant such that for any T G T per fl T± tCo there exists 
H G H pcr fl H* which is Ci-equivalent to T, and vice versa. 

The proof of this proposition is a simple translation between the vocabulary of meshes 
and the vocabulary of metrics. We first establish that m+(X, Y) < Cm^c (X,y), and 
then that m^ Co (X,Y) < Cm*(X,Y). 

Let T G TpcrflT^Co be a mesh which separates the sets X and Y, and let H G HperflH* 
be a metric which is C\ equivalent to T . We thus have for any T G T and any z G T 

Ci 2 H(z) <U T < C\E{z). 

Let x G X and let y G Y. Let T G T be the simplex containing x and let Vj-(T ) be the 
neighborhood of T in T. Since x G T and y £ Vj-(T ) we have \\x— y\\u To — c := (Cov^f) -1 



according to Proposition 5.4.5 (in the previous chapter). It follows that 

d H (x,y) > ln(l + \\x - y\\ H (x)) > hi(l + \\x - y\\n To / c i) >r := ln(l + cCf 1 ). 



The metric H := r 2 H thus belongs to H per fl since < r < 1, see Remark 



5.1.13 



-2 

Furthermore this metric separates the sets X and Y and satisfies according to Proposition 

m(H) < (CVrf|T eq |#(T). 
Taking the infimum over all meshes T G T*,c which separate X and Y we obtain the left 



part of (6.26) : m*{X,Y) < {T^Kd/r^ m^ C() (X,Y). 
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We now turn to the proof of the right part of (6.26), and for that purpose we consider 
a Riemannian metric H G H* fl H per which separates the sets X and Y. We consider a 
parameter A > 1, which value will be specified later, and we observe that H := X 2 H G 
H* n H por . Therefore there exists a mesh T G T*,c H T por such that for all T G T and all 
z G T one has 

C^ 2 H(z) <U T < C 2 H(z). 

Let us assume that T does not separate the sets X and Y, hence that Vr(X)r\Vr(Y) ^ 
0. In that case there exists two simplices T,T' G T sharing a vertex u, and two points 
x G T PI X and y G T' n y. For any simplex T and any z, z' £ T we have ||z — 2'||-h t < 2, 
see (5.9). Therefore 

A < dfr(x, y) < djj(x, v) + d^(v, y) < C x {\\x - v\\ %T + \\v - y\\u T ,) < AC X . 

We now choose the particular the value A := 4Ci + 1, which contradicts the previous 
equation and shows that T does separate X and Y. Furthermore 

|T eq |#(T) < Cfm(H) = (Ad) d m(if). 

Taking the infimum among all metrics H G H* which separate X and Y we obtain 
\Teq\m* t c (X, Y) < (XCiYm^^X, Y), which concludes the proof of this proposition. o 

As a concrete example, we define r := 1/4 and for any < 5 < r we consider in the 
following the sets 

X := {x G R d ; d(x,7L d ) < r } and Y s := {y e R d ; d(y,7Z d ) > r + 5}, (6.27) 



which are illustrated on Figure 6^ The next theorem gives a sharp estimate of the minimal 
mass of a metric which separates the sets X and Y$, as 5 — > 0. The notation as — 0s 
stands for the following : there exists a constant C > 1 such that for all < 5 < r 

C~ l a 5 </3 s < Ca s . 

Theorem 6.3.2. We have the equivalences 

mi (X,Y s ) ~ 
m a (X,Y 5 ) ~ r^|ln(J| 
m,(X,F 5 ) ~ 5- 



2 



Furthermore for any * G {i, a, g>} and any < 5 < ro tae explicit metric H$ G H* fl H per 
defined below is a near minimizer of the problem m*(X, Yg). In other words H$ separates 
X and Yg and satisfies m(Hf) ~ m^X, Yg). 

The proof is divided in two parts : we first give the explicit expression of the metrics 
H$ and we use it to obtain an upper bound on m*(X, Yg). We then prove a lower bound 
on m±{X,Y 5 ). 



6.3. Separation of two domains 
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6.3.1 Explicit metrics and upper bound 

We define in this section the metrics H$, where * G {i,a,g} and < 5 < r := 1/4. 
We then establish in Lemma 6.3.3 that H$ G H*, in Lemma 6.3.4 that H$ separates X 
and Ys, and we prove in Lemma 6.3.5 an upper estimate on the masses m{Hf) of these 
metrics. 

In order to define H$ we need to introduce some notations. For any z G W 1 \ {0} we 
define 

s(z) := min{r , \ \z\ — r |} and 9(z) := —. (6.28) 



For any < 5 < r and any z G M d \ {0} we define three matrices Sg(z) G S^, where 
* G {i, a, g}, as follows 



Sl(z) 



niax{s(z), 5} Id 

max{s(z), 8} 9(z)9(z) T + max |s(z), v^} (ld-9(z)9{z) T ) 
max{s{z),5}9(z)9(z) T + y/r max{s(z), 6} (Id -9(z)9(zf 



(6.29) 



We also define S$(0) = tq Id for any * G {i, a, g}. Eventually we define the metrics H$ by 
the equality 



2fl?(z + «)-s :=S^) 
for all ^ G [— |, and all u G 5Z d . Observe that 



Id for all z G [-1/2, l/2] d such that \z\ > 2r = 1/2, 



(6.30) 



(6.31) 



since Sg(z) = r Id on this set. Figure [8] in the main introduction of this thesis illustrates 
some (finite) triangulations which are respectively equivalent to the metrics Hj-, * G 
{i,a,g}. 

Lemma 6.3.3. For all < 5 < r$ and all * G {i, a, g} one has H$ G H per PI H*. 



.1 l]d) 
2 ' 2 J /' 



and therefore Ht = Id 



Proof: We first remark that s = ro on the boundary d([- 
on this set for any * G {i,a,g}. Using (6.30) we thus obtain that H$ is continuous and 
TL d periodic. 
We denote 

Sper(z) := min{r , \d(z, 7L d ) - r |} 

and we observe that s per is a Lipschitz function. Since H l & = 4 Id / max{s per , S} 2 , we obtain 
that FL\ G Hj (because z h-> mm{s(z), 8}/2 is Lipschitz), which concludes the case * = i. 



We now consider the case * = g, and for that purpose we use the notations of §5.3| 
(in the previous chapter). Our first step is to establish that the local dilatation di\ z (H) x 
defined in (5.58) is smaller than 1 for all z in the set 

Q:={z eR d ; < \z\ < r - 5}. 



We define the functions 



A := s/2 and ji := a/s/4 
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and we observe that H% 
and we have 



\- 2 ee T 

VA = 



/j, 99 on f2. The functions A and \i are C°° on f2 



-9 



and V/x 



-9 



Therefore for any zGO 



and 



dil^lnA; 



dil^ln^; 



dil,(A; d H ) \\V\{z) 



\H(z)- 



X(z) 



\{z) 
\\VKz)\\h( z )- 



-9(z)/2\\ H(zr 
X(z) 

X(z) 



1/2, 



1/4. 



K z ) K z ) 8^/s{z)fi{z) 

On the other hand the derivative of 9 at a point z6R d \ {0} in the direction a6R' i 
the component of u orthogonal to 9 and divided by \z\ : 



is 



89 
du 



(z) 



u- (u,9(z))9{z) 



therefore 

On the other hand for z G Q 

AO) 



diU0: d 



H(z) 



fi(z) X(z) 



r 



s\z 




< r — s(z) = \z\ 



where we used the fact that \/t — 1/ \/t < t for all t > 1 (which is easily checked by 
derivation). Therefore 



AO) fi(z) 



H{z) AO) 



dil,(0; d H ) < ^ < 1/* 



It follows that the quantity D z (a) x defined in (5.59) is smaller than 1/2, and therefore 
that di\ z (H$) x < 1 on Q according to Theorem 5.3.1 Proceeding likewise we also obtain 
that di\ z (H®) x < 1 on the annulus {z G lR d ; r + 5 < \z\ < 2r }. Furthermore iff is 
constant on the annulus {z G M. d ; r — 5 < \z\ < r 
(-1/2, l/2) d ; \z\ > 2r }. Defining 



5} and on the domain {z G 



T := {z G 



G {0, r - 5, r + 5, 2r }} U <9([-l/2, l/2] d ), 



; \r. 



and r := {z + u ; 2; G r , u G 2Z d } we have thus established that di\ z (Hg) x < 1 on 
Corollary 5.2.6 thus implies that H 9 & G H g . 

The proof that H% G H a is extremely similar to the proof that iff G H g : the quantities 
D z (a) x and D z (a) + defined in (5.59) and (5.60) can be explicitely computed on M d \r and 
are smaller that 1/2. Theorem 5.3.1 thus implies that the local dilatations dil z (Hf) x and 
dil 2 (i?5) + are bounded by 1 on IR d \ T, and it follows from Corollary 5.2.6 that Hf G H a . 
The details of the proof are left to the reader. o 



The next lemma establishes the that the metrics Hg do separate the sets of interest. 
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Lemma 6.3.4. For all < 5 < ro and all * G {i,a,g} the metric H$ separates the sets 
X and Yg. 

Proof: Let 7 G C\[0, l],R d ) be such that 7(0) G X and 7(1) G F 5 . Since the sets X, Y s 
are 2Z d -periodic, as well as the metrics we may assume that 7(0) G [-§, |] . Recalling 
the definition (6.27) of X and I5 we obtain 

|7(0)|<r and \j{l)\>r + 6. 

We define 

t := max{t G [0, 1] ; 7 (£) G X} and ti := min{t G [t , 1] ; lit) G 
For any t G [to, £1] we have 

7(*o) =r < \j(t)\ < r + 5 = 7 (t 1 ). 

and therefore 



7 WK*(7«) 



> 



2|<y(t),6)(7(t)))| _ 2|| 7 r(t)| 
5 5 



where | 7 | stands for the function z 1— >■ |7(^)|- Hence 



Wt) > / IIY(*)l|fl?( 7 ( t) )<ft> 



2|7l'(0* oW*i)|-|7(*o)| 



5 



5 



It follows that d,H*{X, Ys) > 2 which concludes the proof. 



o 



The next lemma estimates the mass of the metrics m(Hg), which yields an upper 
estimate on the minimal mass m*(X,Ys) required to separate the sets X and Ys using a 



metric in H pcr fl LL 



Lemma 6.3.5. There exists a constant C — C(d) > such that for all < 5 < r 



m(Hi) 


< 


cs -(d-D 


m(Hf) 


< 


C5-^\ 


m(H%) 


< 


C5-^. 



(6.32) 
(6.33) 
(6.34) 



Proof: Recalling the definition (6.30) of H$ in terms of S| we obtain 

dz 



m(H*) = 8 d c + 2 d [ 



\z\<2r det S$(z) 

where Cq stands for the volume of the following set, on which H$ = 8 2 Id according to 

(pil, 







1 


r 


d 




c : = 


{•« 


~2' 


2_ 




\z\ > 2r | 
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Furthermore 



/ 

J\z\ 



dz 



Cl 



2r _d-l 



r dr 



s; r 



where Cj stands for the d — 1- dimensional volume of the sphere {z 6 ; \z\ — 1}, and 
where 

sj(r) :=det5J(z) 

for any 2 such that |z| = r. Therefore there exists a constant Ci > 0, independent of 5, 
< 5 < r , and of * e {i, a,g}, and such that 

/■ 2r ° dr 



Recalling the definition (6.29) of Sg(z) we obtain s|(r) = max{<5, \tq — r|} for all < r < 



2ro, hence 



2r„ 



dr 



r=0 °<5 



si r 



r °- 5 dr 



!o ( r o - r) d 

= 2r (d - 1} + 2 



+ 



r +5 



ro-5 



dr 



+ 



2r 



ro+5 



(r - r ) c 



d- 1 d- 1 



which establishes (6.32). Similarly, we find sf(r) = r 2 max{<5, |r — r|} a 1 , hence 



d-l 



2r 



dr 



=o 4( r ) 



ro-5 



dr 



o (ro — r) d 2 1 



+ 



ro+5 dr r 2r df . 

+ 



•o-5 5 <i 2 1 y ro+1 5 (r - ro) 



rf+i 



d-l 

2<T~3~ +2 



,_. d-l 

2r— 2r 2 



d-l (d-l) 



d-l 



which establishes (6.34). Eventually, we have Sg{r) = max{<5, |r— r |} max <j y/S, \r — r | 
and therefore 



2r 



dr 



dr 



dr 



( r ) Ji s \ r o- r \ d J Js |r - r|<$V Ao-,; 5 



r +5 



dr 

. d+l ) 
2 



(6.35) 



where 



0, ro-v^l U fr + y/5, 2r 



and Js 



r 



- \/5, r - 5 U r + 5, r + VS 



Therefore 



2r ° dr 
r=o ^(r) 



d- 1 



75 



-(d-i) 



- r. 



-(d-l) 



•28~ V (in (v 7 ^) - In 5 



+2<T^ 



d-l 
2 
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which establishes (6.33). 



o 



6.3.2 Lower bound 



We prove in this subsection the lower bound announced in Theorem 6.3.2 



Lemma 6.3.6. There exists a constant c = c(d) > such that for all < 5 < r , 

m t (X,Y 6 ) > c5- {d - l \ (6.36) 
m a (X,Y s ) > c<T^|ln5|, (6.37) 
m g (X,Y s ) > c8-*r. (6.38) 

Proof: We consider a metric H G H*, where * G {i,a,g}, which separates the sets X 



and Ys. In particular H G H 9 . We denote by § := {v G 



1} the euclidean unit 



sphere of Mr, and by u the d — 1-dimensional volume of §. We define for all v G S 



n[v) 



\v\\ H {r v) and m(v) := inf = \\H(r v) * 

\w\=l 



Proposition 5.2.10 (in the previous chapter), states that for any z,u G M. d we have 

d H (z, z + u) < - ln(l - ||w||h-( z )), (6.39) 

where the right hand side equals oo by convention if ||u||if( z ) > 1. 

We now consider a point v G § and we observe that r Q v G X and (r + 8)v G Yg. 
Therefore 

1 < d H (r v, (r + S)v) < - In (l - 5||u||fr(ro«)) , 
hence X5n(v) > 1 where A := (1 — e -1 ) -1 . Let v,w G S, and let a G {—1, 1} be such that 
o~(v, w) > 0. Remarking that 2r + <5<3/4<lwe obtain 



r v 



+ ay/5- 



= r 2 + 5 + 2r \/So-{v, w) > r 2 Q + 5 > r 2 + (2r + 5)5 = (r + 5) 2 . 

Hence r v G X and r w + ay/Sw G I5. Therefore 

1 < d H (r v, r v + a\/5w) < - In ^1 - \/S\\w\\ H{rQV ^ , 

which implies X\/6m(v) > 1. We we now distinguish between the three cases * = i, a or 
9- 

- If * = i, then H(z) = Id/s(2;) 2 where s is a Lipschitz function, and s(rov) = 



n[v) 



< X5. It follows that 



m(H) > 



2r r d-l drdv 



S Jo 



s[rv 



> 



> 



ro+s r d - l drdv 



r d - l 5uj 
(X5 + 5) d 



{n{v)- 1 + 5) d 



(x + iy 
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where u denotes the d — 1 dimensional measure of S. Taking the infimum among all 
metrics H E Hj which separate the sets X and Yg, we obtain as announced 



m i (X,Y s )>c5- {d - 1) where c 



r u 



- If * = a, then the functions z i— > \\H(z)\\~5 as well as z i— > \\H(z)~^\\ are Lipschitz. 
Hence for any v E § and any /i6Rwe have 

||fl"((ro + /i)v)|rs < n(w)^ 1 + |//| < X6+\fJ,\, 
\\H((r + fj,)v)~*\\ < m(u)" 1 + |//| < A\/5+|/i|. 

Recalling that detM > ||M|| |] iWf _1 1| for any M E Sj we obtain 
(A<5 + | M |)(Av / 5+ | / u|) d - 1 v / deti/((r + / u)t;) > 1. 

It follows that 



m(H) > ^' 



''0 



(A5+|//|)(AV^+H) d - 



> w 



/n 



(A5 + ^)(A v / 5 + V6) d ~ l 

_ -(lnV5-ln(A5)). 
((A + l)^- 1 



Hence 



m a (X,Yg) > c5 2 (—In 5 — 2 In A) where c 



2(A + l)^ 1 

which concludes the proof of (6.37). 

We now consider the case * = g. We have for any v E S> 

^det if (r u) > . . -. ., 1 > ' 



n{v)m{v) d - 1 ~ n^iXVsy- 1 ' 
Using (5.54) (in the previous chapter) we therefore obtain for any /jg! 

y/detHdro + rfv) > (1 - Vdet #(r t;) > (1 " ^H*))* 

hence 

~ J$J-r niv^xVsy- 1 

r r^v) r $-\l/2) d dndv 

~ JsJo n(v)(A v / 5) d - 1 

= wt 1 
2 d (XV6) d - 1 ' 
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Figure 6.2 - Weak separation of two sets by a triangulation. 



Therefore 



m g (X, Yg) > cS 2 where c 



cur 



d-i 



2 d\d-i 



which concludes the proof of ( |6.38 ) and of this lemma 



o 



Remark 6.3.7 (Constrained triangulations). A variation of the problem considered in 
this section is often considered in the litterature. We say that a mesh T G T weakly 
separates two regions X, Y C M d if 

mt(V T (X)) n mt(V r (Y)) = 0, 

where 'mt(E) refers to the collection of interior points of E. 

A common approach for weakly separating two closed and disjoint sets 1,7 C M d , is 
based on constrained mesh generation. The first step is to build a polygonal hypersurface 
T such that for any path 7 G C°([0, 1], R d ) satisfying 7(0) G X and 7(1) G Y , there exists 
s G [0, 1] such that 7(5) G V . The second step is to build a mesh T of the domain Q which 
is constrained to contain the polygonal surface T in its skeleton UrerdT. 

The main advantage of weak separation is that it can be achieved efficiently without 
requiring anisotropic simplices, see Figure 6.£\ Heuristically the same number of simplices 



is required to separate the sets X and Y$ "weakly" with an isotropic triangulation, or 



"strongly" with a graded triangulation. We refer the interested reader to for a survey 
on constrained mesh generation. 



6.4 Approximation in a given norm 

We compare in this section the approximation error, measured in the LP norm or the 
W 1,p semi norm, of a function / by finite elements on a mesh T with the error associated 
to / and a metric H. In this section, as well as in the next one, we fix an integer m > 2 
and we consider finite elements of degree m — 1. 

The greek letter a always refers to a d-plet of non-negative integers 

a = Oi, • • • ,a d ) G 
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and we denote |a| =«! + ••• + oca- We define the monomial 

z° = n Z * 



7Q, 

J i 7 



Ki<t 



where the variable is Z = (Z±, Z d ), and we denote by the collection of all polyno- 
mials of degree at most k. 

P fc : = Vect{Z a ; |a| < k}. 

We introduce a fixed function ip G L°°(IR + ,M) supported in [0,1] and satisfying the 
following moments property : for all fi G P m _i one has 



fji(z)(p(\z\)dz = fi{0). (6.40) 



SR.' 1 



For any / G L[ oc (R ) and any H G H we denote by /# the convolution of / with ip(\ ■ |) 
distorted by the metric H, also referred to as the distorted mollification of / by H, and 
which is defined as follows : for all z G M, d 



f H (z) := / f(z + H(z)-2uM\u\)du (6.41) 
= ^detH(z) [ f(z + uM\\u\\ H(z) )du. (6.42) 

JTR d 

Note that fin = f° r an y polynomial /i G P m _i. Indeed we obtain using the moments 



property (6.40) that for all z G M d 

= + H(zy^0) = ii{z). 

The next proposition establishes that the distorted mollification / i->- f H is bounded 
in some functional spaces. For any metric H G H, any point z6K d and any radius r > 
we define 

B H (z, r) := {z + u; \\u\\ H(z) < r}, 

as well as 

B H (z) := B H (z, 1) and B(z, r) := B u (z, r) 
(the latter being the standard open euclidean ball). Note that 

\B H (z,r)\^detH(z) =cor d where w:=|S(0,l)|. (6.43) 



Lemma 6.4.1. i. (Boundedness from L\ oc to C°) For any H G H and any f G 



loc 



has fn G C°(lR d ). Furthermore for any z G 

< ^/detH(z)\\<p\\ L ~\\f\\ LHBH{z)) , (6.44) 
where B H (z) := {w G M d ; < 1}. 
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2. (Boundedness from W^ c to For any LL G H a and f G one has fu G 

Wfo™ . Furthermore for any z G M. d 

dH z (f H ) < 2Vdet^(z)||^|Uco||V/|| L i ( B HW ) 
where dil^/ff) := lim^o ||V/ff||ioo(_B(a,e)) is the local dilatation of f at z. 



Proof: We first establish 1. The continuity of fn follows from the expression (6.41) of 
the distorted mollification, and from general results on parameter dependent integrals. 
Inequality (6.44) follows from the fact that ip is supported on [0, 1] and from (6.42). 

We now turn to the proof of 2. Metrics LL G H a satisfy \\LL(z)~2 — LL(z')~2 1| < \z — z'\ 
for any z,z' G M. d , see Definition (5.1.12) (in the previous chapter). Hence for any fixed 
u G B(0, 1) the map 

z i— )■ z + H(z)~2u 



is 2-Lipschitz. Derivating (6.41) under the integral sign we therefore obtain 



dUfn) < [ 2\Vf(z + H(z)- 1 *u)\\<p(\u\)\du 

JTR d 

[ \Vf{z + H(z)-3u)\d 

JBrr(z) 



< 2 1 1 ^? 1 1 _t,oo 



'B H (z) 

= 2y/detH(z)\\<p\\ L -\\Vf\\ LHBH(z)) 
which concludes the proof of this proposition. 







6.4.1 The L p error 



This subsection is devoted to the proof of the following proposition, which states that 
the finite element approximation error measured in the LP norm of a function / on a mesh 
T, is controlled by the error e#(/) p associated to a metric H when the mesh satisfies the 
size condition (6.45). 

Proposition 6.4.2. There exists a constant C = C(m, d, ip) such that the following holds. 



Let LL G H pcr fl H and T G T Der be such that 



per 



4 2 LL(z) <n T , TeT, zeT. 



(6.45) 



Then for all 1 < p < oo and all f G L v vcx we have 



11/ — I™ /H'||Lp([o,i] d ) — Ce H (f) p , 



where we denoted LL' := 4 2 LL. 

Our first intermediate lemma studies the variations of the balls Bh{z, r) as z G M. d and 
r > vary and when LL G H s is a graded metric. We recall, see (5.53) and (5.54) (in the 
previous chapter), that for any LL G H ff , and for any z,u,v G 
one has 



satisfying \\u\\ H{z) < 1 



\u\\h(z))\\v\\h(z) 



< 



\\v\\h(z+u) < (i - IMIh^)) 

(1 - \\u\\ H ( z) ) d y/detH{z) < y/det H{z + u) < (1 - \\u\\ H {z) y d ^ det H(z] 



(6.46) 
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Lemma 6.4.3. Let H E H g and let z E R d . 

1. For any z' E B H (z, 1/2) one has B H (z', 1/4) C B H (z). 

2. For any z' E B H (z, 1/6) one has B H {z', 1/2) D B H (z, 1/4). 

Proof: We first prove Point 1., and for that purpose we observe that for any q E M. d 

\\q — z'\\h(z') > (1 - \\z' - z\\ H (z))\\<l - z '\\h(z) 

> (1 - \\z' - z\\ H ( z ))(\\q - z\\ H ( z ) - \\z' - z\\ H ( z )) 

> (l-l/2)(||9-*|| flW -l/2) 

If \\q — z\\h( z ) > 1 then the previous equation implies \\q — z'\\h( z ') > 1/4, which establishes 
the first inclusion. 

We now turn to the proof of Point 2. We obtain for any q E M. d 

\\q- z'\\ H {z>) < (1 - \\z' - z\\ H{z) Y l \\q - z'\\ H{z) 

> (1 - ||* - z'\\ H(z) )(\\q - z\\ H(z) + \\z' - z\\ H(z) ) 

> (l-l/6)-\\\ q - z \\ H{z) + l/6). 

li\\q- z\\ H{z) < 1/4 then \\q - z'\\ H{z/) < |(| + |) = 1/2. Hence B H (z', 1/2) D B H {z, 1/4) 
as announced which concludes the proof of this lemma. o 

The next lemma estimates the finite element approximation error locally. 

Lemma 6.4.4. There exists C = C(m,d,(p) such that the following holds. Let H E H g , 
T E T, z E R d and let 

T Z :={TET;TC B h (z, 1/2)}. (6.47) 

Then 

HZ-irViHUnn.) < Ce H (f; z) p , (6.48) 
where H' := 4 2 H and fl z denotes the union of all triangles in %. 

Proof: We denote by C\ the norm of the interpolation operator 

I-- 1 : C°(T cq ) C°(T cq ), 

where T eq denotes the reference equilateral simplex. We obtain using a change of variables 
that for any triangle T and any g E C°(T) one has 

|| 1™ 1 g\\L°°(T) < CiHglU^T)- 

Let \i E P m _i be a polynomial satisfying ||/ - h\\lp(b h (z)) = e H {f] z) p . Let g := f - u. 
and let T E%. We have 

II It 1 9h'\\lp(t) < 1 9h'\\l°°(t) 

i 

< Ci\T\p \\g H i ||l°°(t) 

i 

< ClK \ v \\gH'\\L°°{B H (z,l/2))- 
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It follows from (6.44) that 



\9h>\\l°°(b b (z,i/2)) < \\<p\\l°° max{||p|| £ ,i(B H/ ( z '))v / det^'(2r / ) ; z' G B H (z, 1/2)} 
< \\<p\\l°° \\g\\mB H (z))4 d (l - l/2)"Vdet H(z), 



where we used the inclusion stated in Point 1. of Lemma 6.4.3 and (6.46) in the second 
line. Defining C* := 8 d Cj H^H^oo we thus obtain 



Tin-l _ II 

V 9H'\\LP(n z ) 



9h'\\ L p(t) 



\TeT z 



< C*\\9\\L^B H (z))V det H ( Z ) Yl l T l 



< c*II^IIlx(bhW)-, Sh(z) 

< C*u\\g\\ LP (B H (z)), 



B H (z)\v 



where we used in the third line the expression (6.43) of the volume \Bjj(z)\, and the 
inclusion (6.47). Since the distorted convolution reproduces polynomials in P m _i , as well 



as the interpolation operator I™ \ we have 

g - Vfr 1 9h = (J-I*)- lr'\f -l*)H = f- IT 1 fn- 



Therefore 



11/ -IT h 



\LP{il z ) 



| Ttn-1 „ || 

\9 - h~ 9H\\LP(n z ) 



< \\g\\Lp(n s ) + \\It~ 9h\\lp{q z ) 

< (1 + C*U}) \\g\\LP(B H (z)) 

= (1 + C*u) e H (f; z) p 



which concludes the proof. 



o 



We now prove Proposition 6.4.2 For each triangle T G T we define 



VL T := {z eR d ; T C B H (z, 1/2)} and n(T) := / y/det H(z)dz 



and we observe that 



z G fir if and only if T G %, 



where T z is defined in (6.47). Note also that Qt C B(zt, 1/2) since #(z) > Id for all 
z G R d , hence the sets it + f2y, w G 2Z d , are pairwise disjoint. We denote by % a system 
of representatives of the set T for the relation of equivalence 



T ~ T' if T = T' + u for some u G 
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It follows from the local error estimate (6.48) that 
C p e H (fY p 



> 



\ p dr 



C p [ Vdet H(z) e H (f; z)ldz 
[ y/det H(z) \\f-ir l fn\ 

J[0,l] d 

/ ^XH{z) ^ ||/ - I?" 1 f H \\ P LP{T) dz 

y; ii / - ir 1 /Am / v^W) t x ^ - U )dz 

Y, \\f -IT 1 M\i KT) [ Vdet H(z) X n T (z)dz 

Eii/- i r 1 ^ir LP(T ^T 



TeT, 



> 11/ -Ir /^ll^([o,i]' i )? 1 i^ riT - 
We now establish that ny is uniformly bounded below, which concludes the proof of 



Proposition 6.4.2 It follows from (6.45) that T C 5^(^,1/4), where zt denotes the 



barycenter of T. The inclusion property established in Point 2. of Lemma 6.4.3 thus 
implies that Bh(zt, 1/6) C Qt- We thus obtain the lower bound 



n(T) > \B H (z T , l/6)\ min ^det H (z) 

z&B H (z T ,i/e) 



> 



uj6 



-d 



V'det H(z T 
w(5/36) d , 



Vdettf(#r)(l-l/6) £ 



(6.46) on the variations of y/det H(z). 



where we used the explicit expression (6.43) of the volume of \B H (z, r)\ and the estimate 



6.4.2 The W liP error, when the measure of sliverness is uniformly 
bounded 

We establish in this subsection the following proposition which shows that the finite 
element approximation error, measured in the W l,p semi norm on a mesh with bounded 
measure of sliverness, is controlled by e a H {W f) p . 

Proposition 6.4.5. There exists a constant C = C(m, d, (p) such that the following holds. 
Let H G H pcr D H a and T G T per be such that 

A 2 H(z) <H T , TeT, z eT. (6.49) 

Then for all 1 < p < oo and all f G we have 

||V(/-irV*0lb([o,i] rf ) < Ce a H (f) p m^S(T), 
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where we denoted H' := 4 2 H. 

The main difficulty of this proof is contained in the following lemma, which states that 
a near-best approximation of the gradient of a function, on an ellipse, has the form of the 
gradient of a polynomial. In this proposition we denote by u ■ v the scalar product of two 
vectors u, v £ M. d . 

Lemma 6.4.6. 1. (The orthogonal projection is compatible with gradients) We equip 
the space L 2 (B,IR d ) ; where B := B(0, 1), with the scalar product 

(v, w) :— / v (z) • w{z)dz. 
Jv 

For any f £ H l (M) there exists \x £ P m -i such that the orthogonal projection of V/ 
onto P^ n _2 is V/i. 

2. (Optimization among gradients) There exists C pro j = C pro j(m, d) such that for any 
ellipsoid £ C M d ; any 1 < p < oo and any f £ W 1,P (S) one has 

inf || V/ - VfJ,\\ LP f S ) < C proj hif \\Vf-v\\ L p( S ) (6.50) 



m — 2 



Proof: We denote k :— m — 1, and we first establish Point 1. We define § := <9B and we 
denote by Dk the collection of polynomial vector fields v £ Pf_! satisfying 

div v{z) = for all z £ B 
v{z) ■ z = for all z £ § 

For any g £ if x (B) and any v £ Dk we thus have 

/ Vg ■ v = — / div z/ + / (7(2;) ^(z) • 2; (fe = 0. (6.51) 
Jb Jn Js 

Our first objective is to establish the orthogonal decomposition 



The spaces VP& and Dk are orthogonal according to (6.51 ), and we know that dim VPfc = 
dimPfc — 1. Therefore we only have to show that dini-D^ > dim(P^ _1 ) — dimP^ + 1. 
The vector space Dk is the kernel of the map dk defined as follows 

4:Pti ->• P fc - 2 ©P fc (S) 
v H> (divz/, (u(z) ■ z)\ S ), 

where Pfc(S) denotes the collection of restrictions to § of elements of P^. The kernel of 
the map 

7fc :P fe -> P fe (S) 
fi H> //|§ 

is Ker(7 fc ) = (1 - |^| 2 )IP fc _ 2 . Hence dim(P fc (S)) = dimP fc - dimP fe _ 2 . 
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The map is not surjective, since we have the linear relation 

div v{z) dz = I u(z)-zdz. 

Jv 

Therefore 

dim Dk = dim Ker dk 

= dim(Pf _1 ) -dimlm4 

> dim(P^ 1 ) - (dim P fe _ 2 + dim P fe (§) - 1) 

= dimtPf- 1 ) -dimP fc + l, 

which implies the orthogonal decomposition F k _ 1 = © VP& as announced. 

Finally for any / G i/ 1 (B) the vector field V/ is orthogonal to according to (6.51 ). 
The orthogonal projection of V/ onto P^_! is therefore also orthogonal to Dk, hence is 
an element of VP& which concludes the proof Point 1. 

We now turn to the proof of Point 2. We denote by Q : L 2 (B, R d ) Pjj^ the L 2 (B) 
orthogonal projection. We recall that for any v G L 2 (B,IR d ) the projection Q(v) G P^_i 
is defined by the collection of linear conditions 

> - Q(v)) ■ v = for aU v G P^. 

Hence Q(v) depends linearly on a finite number of moments of v, and therefore conti- 
nuously extends to all v G L 1 (B,IR d ). Therefore there exists a constant Cq such that for 
all 1 < p < rx and all f G 7 " ; R 



\\Q(v)\\ Lm <C \\v 



" tf( 



For any A G GL d and any u G L 2 (B,IR d ) we claim that Q(A(v)) = A(Q(v)). Indeed for 
any v G Pfc_i one has 

(A(v(z)) - A(Q(v(z)))) ■ is(z)dz = [ (v(z) - Q(v(z))) ■ A T (v(z))dz = 0. 

Consider 1 < p < oo, / G W^QB) and A G GL d . Let fi G P m _i be such that Q{Vf) = 
V/i, and let v G P m _x> we obtain 

\\A(Vf-Vfi)\\ LPm = \\A(Vf)-A(Q(Vf))\\ Lm 

= \\A(Vf)-Q(A(Vf))\\ LPm 

= \\A(Vf-v)-Q(A(Vf)-p)\\ Lm 

< (l + C )\\A(Vf-u)\\ Lm . 



Therefore 



inf ||A(V/-V/i)|| LP(B) <(l + C ) inf \\A(Vf - v)\\m 
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Using a linear change of variables E — > B we obtain the announced result (6.50), with the 
constant C pro j := C + 1. This concludes the proof of this lemma. o 



The rest of the proof of Proposition 6.4.5 is extremely similar to the proof of Propo- 
sition 16.4.21 The estimates 



\f H (z)\ < ^det H{z) IMIzo 
I™ -1 9\\l°°(t) < C/||V^||ioo( T ), 



L 1 {B H {z)) 



used in the proof of Proposition |6.4.2 have the counterparts 

d\\ z {f H ) < 2^detH{z)\\ip\\ L ^\\V f\\ L r {BH{z)) 
\\VVT 1 gh-m < C'jWVgh^SiT), 



which are proved respectively in Lemma 6.4.1 and Lemma 3.6.3 (in Chapter^). From this 



point the adaptation of the proof is straightforward and is therefore left to the reader. 



6.4.3 The W 1,p error, on a general mesh 



This subsection is devoted to the proof of Proposition 6.4.7, which estimates the W l,p 
approximation error of a function / on a mesh with arbitrary measure of sliverness, in 
terms of the approximation error e 9 H (Vf) p with respect to a metric. This approach does 
not allow to recover the optimal anisotropic error estimates for W 1,p norms developed in 
Chapter |3j However it is perhaps more taylored to current anisotropic mesh generation 
software, which generally do not guarantee any bound on the measure of sliverness (or 
the maximal angle in dimension 2, since S(T) = max{l, tan(6*/2)} for any triangle T of 
maximal angle 9). 

Proposition 6.4.7. For all Co > 1 there exists a constant C = C(Co,m,d,ip) such that 
the following holds. Let H e H per R H 9 and let T G T per be such that 

A 2 H(z) <H T < (AC ) 2 H(z), TeT, zeT. (6.52) 

Then for all 1 < p < oo and all f e W^f r we have 

\\^U fH>)\\L^m< c z 9 HU)r, 

where we denoted H' := 4 2 H. 

Our first step in the proof of this proposition is the next lemma which estimates the 
local approximation error. 

Lemma 6.4.8. For all Cq > 1 there exists C = C(Cq, m, d) such that the following holds. 
Let H G H por and T G T per be such that (6.52) holds. Let T G T and z G T. Then for all 
1 < p < oo and all f G we have 

l|V(/ -ir 1 f H >)\\ L v(T) < Ce 9 H (Vf; z) p , (6.53) 



where we denoted H' := 4 2 H. 
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Proof: We assume in a first time that z = and that H(z) = Id, in such way that 
Bh(z) = B := 5(0, 1) is the euclidean unit ball. We consider a polynomial \i G P m _i 
such that || V(/ — h)\\lp(b) is minimal, and such that J B (f — //) = 0. The latter point can 
be ensured by adding an appropriate constant to \i. Defining g :=/ — //, we obtain using 
Lemma 16.4.61 

Since g G and J B g = there exists according to Sobolev's injection theorem a 

constant C so b = C so t,(m, d) such that 

IMU^B) < Csob || 1 1 ii (B)- 

The function is continuous on T and satisfies according to Lemma |6.4.1| 

\\9h'\\loo{t) < \\(p\\oc^^{\\g\\L^B H ,(z'))\/ det H '( z ') ; z' eT} 

< |M|oo|M|Li(B)4 d (l-l/2) d 
= Ci||p||i,i(B), 



where we used (6.46) and the fact that ||z — z'Hi^z) < 1/2 for all z' G T. 

We denote by d the norm of the operator VI^T 1 : C°(T cq ) -»■ P^_ 2 , where T eq 
denotes the reference equilateral simplex : for any h G C°(T eq ) 

ll^ 1 ^ 1 ^IU°°(T cq ) < Cl||/i||L°°(T cq )- 

1 

We recall that T-Lj,(T) is the image T eq by a translation and a rotation, see Proposition 

_ 1 

in the previous chapter. Choosing the function h := g H r o"H r 2 G C°(T cq ), we obtain 



5.1.3 



after a change of variables 

_ 1 

5 /T-7 jm—1 "XII y— f II _ II 

>(T), 



|?V(vi™-V0IU~(t) < Cjii^iU" 



which implies 

_^ 1 

|| V fi , _H"'|U°°(T) < Ci||"Hj.|| 1 1 P^' II L°°(T)- 

I 1 

We thus obtain, since ||"Hy|| < Co\\H'(z)* \\ = ACq and since |T| < |B|, 

\\Wjr~ 1 9h' II lp(t) < |B|p||VIy _1 gH'\\L°°(T) 

1 1 n 

< 4CoCi|B|p||ph'IU°°(t) 

< 4C C 1 Ci|B|p 11^11x1(8) 

< 4CoC 1 C I C sob |B|?||V^|| L i (B ) 

< 4C C 1 C I C sob |B|||V(7|| LP (B). 

Defining C* := 4C CiCiC so b\M\ we obtain 

iiva-irVif)iiL P( T) = nv^-ir 1 ^)!!^) 

< (1 + C,)||V^|| L p(b) 
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where C = (1 + C*)C pro j. 

We now turn to the general case, and we do not any more assume that z = and 
H(z) = Id. We define the affine change of coordinates <3>(V) := H(z)~*z'+z, and we apply 
our previous reasonning to the metric H§ defined by (5.30) (in the previous chapter), 

H*(z') := H{z)-^H{^{z'))H{z)-^, 



which satisfies H$(0) = Id and if$ E H s according to Proposition 5.2.2 (in the previous 
chapter). We also define the triangle X$ := $ _1 (T), the function /* := / o $ and the 
metric H'^ = 4 2 H<s>. The first part of this proof implies that 



V(f - I?T fn'JhnT.) < Ce^iyr- 0)„. 



We thus obtain 



nva-irVifOik^T) < n^)ii !f ii^)-*(v(/-ir 1 /^))iii*(r) 

= \\H{z) || \ (det H{z))-k\\V{f - I-- 1 feJWvw 

< C\\H(z)\\HdetH(z))-^e a H ^f; 0) p 
= C\\H(z)\\h a H (H(zyHvf);z) p 

where we used the change of variables $ _1 : T — >■ T$ in the second and fourth lines. This 
concludes the proof of this lemma. o 



We now conclude the proof of Proposition 6.4.7 For any T E T and any z E T, 
recalling that H T < (4C ) 2 #( z) we obtain 

\T cq \/\T\ = v^et-Hr < (4C ) d y/ det H(z). 



Averaging (6.53) over T, it follows that 

llVa-ir 1 /)!!^) < p jf z) P p dz<C J T ^UI{z)e 9 H (f; zf p dz 

where O := (4C ) d C/\T eq \. 

We denote by % a system of representatives of the set T for the relation of equivalence 

T ~ T if T = T + u for some u e 

We thus obtain 

\\*{f-ir l fH')\\i 



lLf([0,l] d ) 



Ilp(T) 



TeT. 

< C'J2 f V<ietH(z)e 9 H (f; z%dz 
TeT* ^ T 

= C [ V 'det H(z)e 9 H (f; z)*dz 
J\o,i] d 



which concludes the proof of this proposition. 
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6.5 Asymptotic approximation and explicit metrics 

This section is devoted to the explicit construction of a metric H adapted to a given 
function /, in the setting where the function / is sufficiently smooth and the mass of 
metric is asymptotically large. 

Our main result is the following theorem, which is the counterpart for metrics of 
the results developed in Chapters 2 and 3. It involves the shape functions K, L a and L g 



which are defined in the introduction of this chapter, £6.1.5 We do not establish that these 
estimates are optimal, but this is suggested by the lower bounds for the finite element 
interpolation error developed Chapters 2 and 3 for admissible sequences of triangulations. 

Theorem 6.5.1. There exists a constant C = C(m,d) such that for each f G C™ r 
hmsup I inf e H (f) p \ < C\\K(d m f)\\ T , 

M->oo \ fleH s 
m(H)<M 

where + Furthermore 



hmsup A^T— inf e%(Vf) p \ < C\\L a {d m f)\\ T 

M->oo \ iiGH Q 
m(H)<M 



hmsup M-r inf e 9 H {Wf) p \ < C\\L g (d m f)\\ T 

M->oo \ aeH s 
m{H)<M 



where - := ^—^ + 



Our first lemma shows that the optimization problems on SL^ defining the shape 
functions K, L a and L g can be reformulated into optimization problems posed on the 
collection S£ of symmetric positive definite matrices. 

Lemma 6.5.2. Let if) G C (GL^, Ml) be homogeneous of degree r > and such that 
if) (A) = if) (AO) for any A G GL^ and any O G Od- Then 

inf{(detM)^ ; M G Sj and ip(M^) < 1} = mf{rb(A) ; A G SL d }. 

Proof: To each M 6 S| we associate A := (det M)^M~^ G GL^, in such way that 
det(A) = 1 and if) (A) = (det M)^ifj(M~ s ) < (det M) Conversely to each A G GL d 
we associate M := ip(A)r (AA 1 )^ 1 G S'j", in such way that (detM)ad = if>(A) and 
ip(M~l) = ifj(ifj(A)-y-(AA T )^) = ip(A)~ 1 ip(AO) = 1 where O := A~ 1 (AA T )^ is orthogo- 
nal, o 

It follows from this lemma that for any 7r G IH m 

K(ir) = inf{(detM)S ; ||7roM"5|| < 1} (6.54) 

L a (7r) = inf { (det M) 1 ^ ; || (Vtt) o ikH|| < 1} (6.55) 

L 9 (tt) = inf { (det M) 11 ^ 1 ; ^/||Af||||V(7roAf-2)|| < 1}. (6.56) 
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Remark 6.5.3. The optimizations among symmetric matrices appearing in these expres- 
sions have a geometrical interpretation : finding the ellipsoid of maximal volume included 
in a set of algebraic boundary. Indeed one easily checks that the three following properties 
are equivalent for all tt G M m and all M G S£ 

i) \\ix o M~§|| < 1. (resp. ||(Vtt) o M~i|| < 1, resp. ^f^M\\\\V{'K o M~\)\\ < I) 

id 

■ \\*\\M J UI aU ^ Cff 

I m— 1 



ii) \n(z)\ < \\z\\m for all z G IR d . (resp. \Vn(z)\ < \\z\\™-\ resp. ||M||l|M-|(V7r(^))| < 



) 



Hi) The ellipsoid defined by the inequality \\z\\m < 1, z G R d , is included in the set of 
algebraic boundary defined by the inequality \ir(z)\ < 1. (resp. |V7r(z)| < 1, resp. 
||M||5|M-5(V7r(2))| < 1 w/w'c/i depends on M) 



We consider a fixed function / G C m and for each z G 
polynomial 7r^ G H m as follows 



|a|=m 



a! ' 



we define the homogeneous 



(6.57) 



where a\ — a%\ • ■ ■ aj for a 



(Qti, 



, a d ) G 



The heuristic guideline of the proof of Theorem 6.5.1 is the following (here in the case 



of approximation in the LP norm). Consider a C°° periodic map M : R d -»■ 5+ such that 
o M(2)~i|| < 1 for all z G lR d , and define 



H(z) 



(det M(z))^M(4 



Then is is not difficult to show that for all A sufficiently large one has XH G H a and 



m(A#)^(/) p <C||(detM) ; 



This leads to the estimate announced in Theorem 6.5.1 
rable to \\K(d m f)\\ LT ^ Q ^ d y in other words if (detM(z)) 
z G " ' 



lM[o,i] d ) • 

if || (det M)i 
u is comparable to K(n z ) for all 



lL-([0,l] d ) 



is compa- 



which means that M(z) is a close to be a minimizer of the infimum (6.54) defining 



K(tt z ). The same principles apply to the two other estimates stated in Theorem 6.5.1 



The main ingredient of the proof of Theorem 6.5.1 is the construction of a continuous 
map n I—)- A4(tt) such that \\tt o J\A(tt)~2 \\ < 1 and such that (det M.{Tr))^d is sufficiently 
close to K(tt) (in the case of the approximation in the LP norm). Up to a few technicali- 
ties, which are adressed in § 6.5.4 , defining M(z) := M(it z ) and reasoning as above then 



concludes the proof. We give in §6.5.1| an explicit expression of such a map in terms of the 
coefficients of tc, in the case of piecewise linear and piecewise quadratic approximation in 
two dimensions. Such explicit expressions are valuable for numerical implementation in 
finite element software which use anisotropic meshes, and have been implemented by the 
author in |93|. 



We introduce in { 6.5.2 the property of "geometric convexity" for functions in C°(S^ , . 



which is a variant of the classical notion of convexity. This is used in §6.5.3 to define some 
variants K^ a \ a G (0, oo) of the shape function K which are defined by well posed opti- 
mization problems on St, in arbitrary dimension d and degree m. In contrast for some 
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choices of it G H m there exists no minimizer to the optimization problem appearing in 
the expression (6.54) of K(n), and other choices there exists an infinity of minimizers. 
We conclude the proof of Theorem 6.5.1 in §6.5.4 Eventually §6.5.5 is devoted to the 
proof of an additional property of the shape functions K, L a and L g : they are uniformly 
equivalent on H m to some continuous functions. 



6.5.1 Explicit Metrics 

We give in this section some explicit minimizers, or near minimizers (in a sense defined 
below) of the optimization problems among symmetric matrices appearing in the expres- 
sions (6.54), (6.55) and (6.56) of the shape functions K, L a and L g . Our results are so far 
limited to the case m — 2, which corresponds to piecewise linear approximation, and to 
m = 3 in dimension d = 2, which corresponds to piecewise quadratic approximation. 

These minimizers or near-minimizers have already been presented in Chapter 2 for the 
shape function K, and Chapter 3 for the shape function L a . The latter are recalled in the 
next proposition, and completed with their counterparts for the shape function L g . 

For any homogeneous quadratic polynomial tt we denote by [n] G Sd the associated 
symmetric matrix, which satisfies z t [7t]z = tt(z) for all z eR d . Note that II 7r|| = || [k] II . 

Proposition 6.5.4. i) (Piecewise linear approximation, in any dimension) If m = 2 
and d > 2, then for any it G IH 2 , one has 

L a (ir) = 2| detvr|5 and L g (ir) = 2a/W y/\ detn\. 



The minimizing matrices M in the expressions (6.55) and (6.56) of L a {ii) and L g (n) 
are respectively 



M a (7r) = 4[tt] 



and Ai g (7i) 



7T 7T 



ii) (Piecewise quadratic approximation, in two dimensions) If m = 3 and d = 2, let 



71 



ax" 



3bx 2 y + 3cxy 2 + dy s G B z , let 



MJtt) 




b c 
c d 



and let 

M 9 (n) := M a (ir) + ( Z pp) ' M 
where disc it is the discriminant of the cubic polynomial n : 

disc7r = 4(ac — b 2 ){bd — c 2 ) — {ad — be) 2 . 



(6.58) 



Then M. a (n) and M. g {Ti) are near minimizers of optimization problems defining L a {j[) 
and L g (n) respectively, in the following sense : there exists a constant C such that 
for all re G JH*3 



i/det M a (n) < CL a (n) and 



det M g (n) < L g {ir). 
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Furthermore if it is not univariate then Ai a {^) and A4 g (ir) are non degenerate and 
we have 



II(Vtt) oM a (Tr)—2\\ < C and ||M fl (7r)||2||V(7ro M t 



7T 



< c. 



(6.59) 



Proof: We recall that V(ir o A) = A t ((Vtt) o A) for any tt G H m and A G GL d . 

We begin with the case i) of a quadratic polynomial tt G H2, and we observe that 
Vn(z) = 2[ir)z. Hence |V7r(z)| 2 = Az t [k] 2 z and the announced result for L a easily follows. 
For L g , we are looking for a matrix M G such that 



|Af2||||Vf7r o M"5' 



2||M5|| ||M~a[7r]Af~2|| < 1, 



(6.60) 



and of minimal determinant. Recalling that ||AB|| > \\A 
therefore ||M~5 [7r]M~2 1| > 



TT 



B 



-in — 1 



for all A,B e GL d , 



M\\, we obtain from (6.60|) that 2||7r|| < 1 1 AiT 2 1| - Hence 
4||7r|||| M~3[7r]M~3|| < 1. (6.61) 

A matrix of minimal determinant which achieves this inequality is clearly M = 4||7r|| \[tt]\, 
where \S\ denotes the absolute value of a symmetric matrix S G Sd- Observing that this 



matrix also satisfies the condition (6.60) we conclude the proof of Point i 



We now turn to the case ii) of cubic polynomial in H 3 . The case of L a and M. a is 
exposed in Chapter [3j We therefore turn to the case of L g . Combining the homogeneity 
and the continuity of the functions involved, one easily shows that there exists a constant 
C such that for all n G H3 



< 



IV 



TT 



< CnlW 



C^hW < A(tt) := \\M a (n)\\ < C Q \\ir\\, 



and 



Ii(tt) := \\Ma{ir) 



-111 — 1 



- disc TT 

IMI 



< Coin 



(6.62) 



(6.63) 



We consider below a fixed polynomial tt G H3, and we recall that 

L 9 (tt) := M{VdetM ; y/\\M\\ || V(tt o M~^)\\ < 1}. 



We first prove the lower bound L g (ir) > c^/det M. g (n), where c > is an abso- 
lute constant. For that purpose we consider a m atrix M G S j~ satisfying the constraint 
HM^im V(ttoM~2)|| < 1. Combining this with (6.62) we obtain 



1 > Cn _1 llM5|||koM~5|i > (7-1 p - 



\M*\ 



7T 



TT 



hence ||7r|| < Co||M||. It follows that 

1 > Co" 1 ^ 1 !^!!)^ noM'*] 



|Mi|| 3 C ||M|| 



Iff O M"2 
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_2 l 

where n := C 2 ||7r|| 2 vr. The solution M of the minimization problem 

inf{detM ; ||7roM~i|| < 1} 



is known exactly in the case of cubic polynomials in two dimensions, see Proposition 2.4.4 
in Chapter [2j and satisfies 

_ 3 , 

(detM) 3 = k\ disc 7r| = k(C 2 ||7t|| 5 ) 4 | disc 7r | = k,Cq 6 \\ ir\\ 2 \ disc 7r|. 

where k is a positive constant which depends only on the sign of disc7f. Defining cq : = 
RsCq 1 , where k stands for the minimial value of k for the two possible signs, we obtain 

L g (n) > VdetM 

= k^Cq' 1 ||7r|| 3 1 disc7r|6 

On the other hand, there exists c\ > such that 

L g (n) > L a (n) > c x ^det M a (7i) = dy/ fi(n) . 
Furthermore we have 

det.M g (7r) = (A(tt) + | |tt 1 1) (a* (vr) + u(ir)) 

< (l + C )min{A(7r), ||tt||} (2 max{ / u(vr), i/(tt)}) 

< 2(1 + C ) max{A(7r)/i(7r), ||7r||i/(7r)}. 

We thus obtain 



Met M g (n) < C*L g {ii) where C* := 2(1 + C ) max{c 1 , c x 1 }. 
which concludes the proof of our lower estimate for L g . 



We now turn to the proof of the property (6.59), which is equivalent to the following 
\\M g ^)\\hM 9 {^{V^{x,y))\\ < C when \\(x,y)\\ Mg{7T) < 1. 



For any rotation U G 2 we have M. a {n o U) — U \M a (ir)U, see Proposition 3.4.3 



in 



Chapter [3j and disc(7r o U) — (disc7r)(det U) e = disc7r, see Chapter |2j We may therefore 
assume, up to a rotation, that -M a (7r) is a diagonal matrix and that the first diagonal 
coefficient is larger than the second one. Therefore 

A(tt) 2 = a 2 + 2b 2 + c 2 and u.(n) 2 = b 2 + 2c 2 + d 2 , 
and At a (ir) = ( ^ ? \ J ■ Note that this matrix is degenerate if and only if u.(ir) = 0, 



Mtt) 

which means that 7r is the univariate polynomial ax 3 . In order to avoid notational clutter, 
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and since the polynomial it e H3 is fixed, we denote below by A, fi and v the quantities 
A(7r), jiiji) and u(tt). Our purpose is to establish an upper bound on the quantity 



\M g (n)p\\M g (n)-HVn(x, 



{ax 2 + 2bxy + cy 2 ) 2 + 



X + u 



(bx 2 + 2cxy + dy 



2\2 



(6.64) 



under the hypothesis 



which implies in particular Ax 2 < 1 and max{/i, i/}?/ 2 < 1. Injecting this in (6.64), and 
observing using (6.62) and (6.63) that v < CqA, we see that it is sufficient to bound the 
following quantities 



a 2 c 2 
A 2 ' A/i' /1 2 



and 



A b 2 Ac 2 



A 



d 2 



/iA 2 ' /i/iA' max{/i, i/} max{/z, z/} 2 



Observing that 



H > max{6, c, d} and A > max{a, b, c, /x} > max{a, b, c, 0?} 



(6.65) 



(6.66) 



we find that all the quantities appearing in (6.65) are smaller than one, except perhaps 
the last one : 

Aci 2 

7 1¥ 6 - 67 

max{/i, v) 6 



Using the expression (6.58) of the discriminant we find that 

- disc 7T = (a 2 + b 2 + c 2 )d 2 + aQ(b, c, d) + R(b, c, d) 

where Q and R are homogenous polynomials of degree 3 and 4 respectively. Hence there 
exists a constant C\, independent of ir, such that 

X 2 d 2 = — disc ir — (aQ(b, c, d) + R(b, c, d)) 

< ||7r||^ 3 + CxA/i 3 

< (C + Ci) A max{/i, z/} 3 . 



The previous inequality yields a uniform bound on ( |6.67 ), which concludes the proof of 
this proposition. 



o 



Proposition (6.5.4) immediately gives an explicit expression, up to a fixed multipli- 
cative constant, of the shape functions L a and L g in the case m = 3 and d = 2 : 
L a {ft) — \fM a {ir) and L g (-n) ~ a/A^ 9 (7t). It follows that 



L g (ax 3 + dy 3 ) ~ vjodj max{|a|, |d|} and L a (ax 3 + dy 3 ) ~ \/|ad|, 



and 



L g (ax 3 + 3cxy 2 ) ~ L a (ax 3 + 3cxy 2 



max{|a|, |c| 
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where f(a, b, c, d) ~ g(a, b, c, d) means that there exists a constant C > 1 such that 
C~ 1 f(a, b, c, d) < g(a, b, c, d) < Cf(a, b, c, d) for all a, b,c,d G 1R. 

For a function which has anisotropic features of the type ax 3 + dy 3 , where a and d have 
different orders of magnitude, the use of quasi-acute meshes and metrics may therefore 
lead to a substantial improvement of the approximation error compared to the use of 
graded meshes and metrics. This is not the case in contrast if all the anisotropic features 
of the approximated function are of the type ax 3 + 3cxy 2 . 



6.5.2 Geometric convexity 

We introduce in this section the geometric average of two symmetric positive definite 
matrices, and the property of geometric convexity for functions defined on S~J. This notion 
is a variant of the classical property of convexity, which is appropriate for the study of 



the minimization problems appearing in the expressions (6.54), (6.55) and (6.56) of the 
shape functions K, L a and L g . 

Let M,M' G Sj, and let S := M~2. We define the geometric average Avg(M, M') G 
Sj of M and M' as follows 



SAvg(M,M')S := VSM'S 
For instance Avg(M, M _1 ) = Id, and for any k, k' > 

Avg(/cM, k'M') = Vkk' Avg(M,M'). 



(6.68) 



The following proposition gives an alternative characterization (6.69) of Avg(M, M' 



This proposition also shows that Avg(M, M') is a natural midpoint between M and M' for 
the distance d x , and establishes two basic properties of the geometric average of matrices. 

Proposition 6.5.5. For any M,M' G the following holds. 

1. A matrix M G Sj[ satisfies M = Avg(M, M') if and only if there exists A G GL^ 
and a diagonal matrix D such that 



M = A T A, M = A T e 2D A, M' = A T e~ 2D A. 



(6.69) 



2. The geometric average Avg(M, M' ) is a midpoint between M and M' for the distance 
d x on : 

d x (M,Avg(M,M')) =d x (Avg(M,M'),M') = ^d x (M,M'). 

3. The geometric average is commutative and compatible with the inversion of ma- 
trices : 



Avg(M, M') = Avg(M',M) 
Avg(M,M') _1 = Avg(M" 1 ,M / - 1 ) 



(6.70) 
(6.71) 
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Proof: We first establish point 1. Let us assume in a first time that M, M' and M have 



the form (6.69). We easily obtain 



MM '.\7 = M' 



Defining S := M 2 we thus have 



(SMS) 2 = SM'S. 



Taking the square root of the previous equation we obtain (6.68) which shows that M = 
Avg(M, M'). 

In order to establish the converse of this property we only need to show that for any 
M, M' G Sj there exists A G GL^ and a diagonal matrix D such that M = A T e 2D A 
and M' = A T e~ 2D A. The expression of Avg(M, M') then automatically follows from the 
previous argument. Since the matrix M'~zMM'~2 is symmetric positive definite, there 
exists U G Od and a diagonal matrix D such that M'~iMM'~~2 = £/ T exp(4D)[7. Choo- 
sing A := e D UM'2 we obtain the announced result. 



We now turn to the proof of Point 2. Let A G GL^ and let D,D' be diagonal matrices, 
of diagonal coefficients D\, ■ ■ ■ ,Dk and D[, ■ • ■ ,D' d respectively. We define 

5 := max \D; — D'A. 

l<i<d 

We claim that the matrices N := A T e 2D A and N' := A T e 2D 'A satisfy d x {N,N') = 5. 
Indeed 



d x {N, N') 



sup I In ||w||jv 



sup 
sup 



In ||w||jv' 

L> i ,,| 1„ l„0' 



\n\e u Au\ -\n\e D Au\ 
In \ e D v I — In I e D v 



For any v = (v 1, • • • , Vj) G M. d we have 



\e D v\ 2 



e 2D *v 2 < e 2 * J2 ^ = ( 



e s \e D 'v\ 



Ki<d 



Ki<d 



and similarly \e D 'v\ < e 5 \e D v\. Therefore | In \e D v\ — In \e D 'v || < 5, and taking the supre- 
mum among all v ^ we obtain d x (N, N') < 5. Furthermore denote by i , 1 < i < d, 
the position such that \D iQ — D' io \ = 5. Choosing v — (0, • • • , 1, • • • , 0), with the nonzero 
coordinate at the position i , we obtain 5 = \\n\e D v\ —\n\e D 'v\\ < d x (N,N'). We thus 
conclude that 



d x (N, N') = d x (A T e 2D A, A v e 2U 'A) = S = \\D - D'\\. 



T„2D' 



Recalling that M, M' and M := Avg(M, M') have the form (6.69) we thus obtain 
d x (M,Avg(M,M')) = rf x (Avg(M,M'),M / ) = ||D|| and d x (M,M') = 2\\D\\. 
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which concludes the proof of this point. 



Finally we establish Point 3. Using the first point we obtain that M,M' and M :- 
Avg(M, M') have the form 



M = A A, M = A T e 2D A, M' = A 



\T-2D 



A 



It follows that 
and 



M = A A, M' = A e A, M = A e A, 



M- 1 = {A- l ) T A-\ M- 1 = (A~ 1 ) T e~ 2D A' 1 , 



M 



-1\T„2D A-l 



e lu A~ 



(6.72) 



(6.73) 



Equation (6.70) (resp. (6.71)) is the consequence of (6.72) (resp. (6.73)) and of the cha- 
racterization of the geometric average given in the first point. o 

We recall that a function ip G C (Sd, K) is convex if and only if for all M, M' e Sd one 



has 



M + M' 



< 



(6.74) 



The next proposition introduces the family of geometrically convex functions on which 
is defined by a property similar to (6.74) but in which the arithmetic averages are replaced 
with geometric averages. 

Definition 6.5.6. We say that a function ip G C°(S'J[,1RL+) is Geometrically Convex and 
Homogenous (GCH), if it satisfies the following properties. 

i) (Sub multiplicativity) For any M,M' G one has 



<p(Avg(M, M')) < y/<p(M)<p(M' 



(6.75) 



ii) (Homogenous) There exists aGl, called the degree of ip, such that for all M G S j" 
and all A G one has 

if(XM) = X a ip(M). 

We say that the function ip is Strictly Geometrically Convex and Homogeneous (SGCH), 
if if is GCH and if the equality in (6.75) only holds if M and M' are proportional. 

For instance the function det is GCH of degree d, since 

det Avg(M, M') = Vdet M det M' and det(AM) = X d M. 

We insist on point that GCH functions are by assumption continuous and strictly posi- 
tive on Sj- The next lemma enumerates some simple properties of geometrically convex 
functions. 

Lemma 6.5.7. 1. The product of two GCH functions is GCH, and the product of a 
GCH function with a SGCH function is SGCH. Furthermore the degrees add. 
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2. The elevation to a positive power a G IR+ of a GCH (resp. SGCH) function is also 
GCH (resp. SGCH). Furthermore the degree is multiplied by a. 

3. If a function if is GCH (resp. SGCH) then M ^ ^(M -1 ) is also GCH (resp. 
SGCH), and has the opposite degree. 

Proof: The first two points are immediate, and the last point is a direct consequence of 



the compatibility (6.71) of the geometric average with the inversion : if ip is GCH and 



M, M' G Sj then 

V 9(Avg(M,M , )~ 1 ) = y?(Avg(M- 1 ,M / " 1 )) < y/ ip{M~ l )ip{M'- 1 ) , 
which concludes the proof. 

Aside from the determinant, a number of functions on are geometrically convex. 
The next proposition, which is based on an argument of complex analysis, enumerates 
some of them. For all 7r G H m we define 

||7r||c := sup{|7r(^)| ; z G (D d , \z\ < 1}. 

Proposition 6.5.8. 1. The trace map M H> TrM is SGCH. 

2. For any fixed polynomial tt G IH m \ {0} the map ||ttoM2|| c is GCH, of degree 
m/2. 

3. The norm map M i-)- ||M|| is GCH. 

Proof: We first establish Point 1. Let M, M' G Sj and let M := Avg(M). Let^A G GL d 
and let D be a diagonal matrix such that M = A T e 2D A, M' = A T e' 2D A and M = A T A. 
We denote by (Aj)i<ij<d the coefficients of A, and we define for all 1 < i < d 

A := E 4- 

l<j<d 

Note that Ai > for all 1 < i < d, since otherwise a full line of A would be zero and A 
would be degenerate. Denoting by ■ ■ ■ ,D d the diagonal coefficients of D we obtain 

Tr(M) = A 2 e 2A , Tr(M') = ^ A 2 e" 2A , Tr(M) = ^ A\. 

l<i<d \<i<d \<i<d 

It thus follows from the Cauchy Schwartz inequality that Tr(M) 2 < Tr(M) Tr(M'). Fur- 
thermore equality occurs if and only if the vectors 

(Aie Di )i<i< d and (Aie' Dl )i<i< d 

are proportional, which implies that Di = Dj for all 1 < i,j < d and therefore that M 
and M' are proportional. 



We now turn to the proof of Point 2., and for that purpose we recall a classical result of 
complex analysis, called Hadamard's three lines theorem. Let Q := {a G (D ; |9?(«)| < 1}, 
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where 9ft denotes the real part, and let g be a continuous and bounded function on Q 
which is holomorphic on Q. Then 



\g(0)\ 2 < sup \g(a)\) sup \g(a)\ . (6.76) 

\SRa=-l / \SRz=q / 

We consider a fixed z G (D d satisfying |z| < 1, and we define for all a G Q 

g(a) := ir(A T e aD z). 

The function g is continuous, uniformly bounded on by ||7r||c (|| A||e"' D ") d , and holo- 
morphic on (D, hence on Q. Therefore Hadamard's three lines theorem applies. For any 
a G { — 1, 1} one has 

sup \n(A T e aD z)\ < ||vr o (A T e aD ) || c . 



SR(a)=o- 

since je 19 ^ 1 ^! = |z| = 1, where S denotes the imaginary part and i the imaginary unit. 



Applying (6.76) to the function g we thus obtain 

\tt(A t z)\ 2 < ||tt o (A T e D )\\ € ||vr o (A T e~ D )\\ € . 

Taking the supremum among all z G (D d such that |z| < 1 we conclude that 

||7roA T || c < ||7ro(A T e D )|| c ||7ro(A T e- D )|| c . 

One easily checks that the matrices 0,0', O G GL^ defined by 

OM? = e D A, O'M'^ = e~ D A and OM^ = A, 

are orthogonal. Since orthogonal matrices are also unitary matrices, in the sense that 
\Oz\ = \z\ for all z G (D, we obtain 



\noA T e D ||c = ||7roM2|| c , 

1 7T o A T e~ D \\ c = \\noM'3\\c 

rp ~ 1 .. 

broA L = 7r o M2 c , 



thus 1 1 -7T o Ma ||c satisfies the sub-multiplicativity property (6.75). Furthermore the func- 
tion M G I—?- 1 1 vr o Ma ||c is clearly continuous, strictly positive, and homogeneous of 
degree m/2 which concludes the proof of this point. 

We now establish Point 3, and for that purpose we denote by n G H2 the canonical 
quadratic form 

ir(z) := 4 
i<j<d 

Our purpose is to establish that for any M G S£ 

UttoM^Ic = ||M||, (6.77) 
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which implies that ||M|| is GCH in view of the previous point. For any z G (D such that 
\z\ < 1 one has 



\ir(M^z)\ = \z l M^M^z\ < \M^z\ 2 < \\M 



r 5||2 



IMII. 



Furthermore choosing a normalized eigenvector z G M d a ssocia ted to the maximal eigen- 
value of M we obtain ^(M^)] = ||M||, which establishes (6.77) and concludes the proof, o 



For each M E St we define 



k(M) := ^Tr(M)Tr(M 



The function k is SGCH according to Lemma 6.5.7 and since M i— > Tr M is SGCH. In 
terms of the eigenvalues Ai, • • • , A^ of M we have 



d 2 K{Mf 



a<i<d 



vl<i<d 



therefore k(M) > 1, using the Cauchy Schwartz inequality, with equality if and only if 
M = m Id for some m > 0. Note also that 

«(M) < Vll^llll^ -1 !! < dn(M). 



We regard the function k as a close variation on the condition number -i/fjMj] ||M _1 || of 
the matrix M*. 



6.5.3 A well posed variant of the shape function 

We define in this section three variants of the shape functions K, L a and L g : for all 
7T G H m 

K € (n) := inf ||7roA|| c 

AeSL d 

LJ(tt) := inf vWW^Sd 
L*» := inf p-lllTToAHc, 

where G(vr) := |V7r| 2 G Hl2 m -2- For each a G (0, oo) we define three new variants of the 
shape functions 

K {a) (n) : = inf{K(M)-(detM)S ; ||vroM"5|| c < l an d K (M) < e a }, 
Li Q) (vr) := inf{K(M)-(detM) !I ^ i ; \\G(n) o M~^\\ c < 1 and re(Af) < e"}, 
4 q) (tt) := inf{/t(M)^(detM) !: ^ ; ||M||5||ttoM-5|| c < 1 and k(M) < e Q }. 

This section is devoted to the proof of the following theorem. 
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Theorem 6.5.9. 1. The shape function K (resp. L a , resp. L g ) is uniformly equivalent 
to K € on B m (resp. L® , resp. ). 

2. For each n G M m we have the decreasing convergence 

lim K (a) {7c) = K € (tt). 

a— >oo 

(resp. Lo"\tv) — > L®(ir) decreasingly, resp. L g a \iT) — > (ir) decreasingly, as a — >• 
oo). 

3. For each a G (0, oo) and each it G M m \ {0} there exists a unique minimizer 
JA( a '{ir) G to the optimization problem defining K( a >(ir), and the map 

neH m \{0}^M {a \n)eS+ 
is continuous, (resp. likewise for L^ (it) and M^iit) , resp. likewise for L g a \n) and 

We first establish Point 1. of this theorem. Since the vector spaces H m and H2 m _2 
have finite dimension, there exists three constants Ck, C a and C g such that 

IKII < IMk < c^ll 71 "^ 7r g H m , 

Ik'H < IItt'Hc < C a \\ir'\\, 7rGH 2m _ 2 , (6.78) 
CjIVttII < ||7t|| c < CJVtt||, tt g H m . 



Hence it follows from the definition of the original shape functions, given in £ 6.1.5 that 
K < K € < C K K, L a < < C a L a and C; l L g < < C g L g on H m . 

Point 2. of this theorem is an immediate consequence of the fact that k(M) G [1, oo) 
for all M eSj. 

The following proposition illustrates the use of k as a regularization term, and imme- 
diately implies Point 3. of Theorem 6.5.9| 

Proposition 6.5.10. Let tp be a GSH function of degree r > 0. Let (X,dx) be a metric 
space, and let ip G C°(Sj x X, M.* + ) be such that ip(-, x) is GCH of degree —1 for any fixed 
x G X . We define for all x G X and all a G (0, oo) 

L {a \x) := inf {k(M)«¥>(M) ; ip(M,x) < 1 and k(M) < e c 



For any x G X there exists a unique minimizer M> a '{x) to the optimization problem 
defining L^ a \x). Furthermore the map Ai^ : X — > is continuous. 

Proof: We consider a fixed parameter a G (0, oo). We first establish the existence of 
a minimizer to the optimization problem defining L^^x), and we thus consider a fixed 
x G X. For any > the following collection of matrices is clearly a compact subset of 

si 

K p :— {M G S% ; | ln(detM)| < (3 and k(M) < e a }. (6.79) 



We define 



A := sup (p(M) and A := inf tp(M, x), 

MeKq MeK 
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in such way that for any M G S~£ satisfying k(M) < e a 

<p(M) < A(detM)^ and ip(M, x) > A(detM) _ i (6.80) 

Let (M„) n > , M n G Sj, be a minimizing sequence for the optimization problem defining 
L^ a \x). It follows from (6.80) that det(M n ) uniformly bounded above and below : hence 
there exists (5 > such that |lndetM n | < /3 for all n > 0. Furthermore «(M„) < e Q 
for each n > 0, hence M n G K^. Since the set is compact there exists a converging 
subsequence M ff ( n ) — > G Kg. By continuity of the functions ip, ip(-,x) and k on S%, 
the matrix M m is a minimizer of the optimization problem L^ a \x). 

We now consider two minimizers M, M' to the optimization problem defining L^ a '(x), 
and we intend to show that M = M'. We denote M := Avg(M, M'). Since the functions 
ip(-,x) and k are GCH, the matrix M satisfies the inequalities 

ijj(M,x) < 1 and k(M) < e a . 

We define for each M G S j~ the quantity 

$(M) :=«(M)«p(Af), (6.81) 

and we observe that $ : 5^ — > M.* + is a SGCH function of degree r > according to 
Lemma 6.5.7 If M is not proportionnal to M' we thus have 

$(M) < v/$(M)$(M') = L (a) (a;), 

which is a contradicts the fact that M and M' are minimizers of the optimization problem 
defining L^ a '(x). Thus M = mM' for some m > 0, but this implies 

Z>)(x) = $(M) = $(M') = m r $(M), 

therefore m = 1 and M = M', which establishes as announced that the minimizer is 
unique. 

Finally we establish that the map x H- M> a '{x) is continuous. For all x,x' G X, we 
define 

u(x,x') := sup |hT0(M, x) — ln^)(M, x')\, 

MeK 



where the compact set K is defined by (6.79), and we observe that u(x,x') — > as 
x' —7- a;. (Note that u(x, x') is the modulus of continuity of the continuous function X — > 
C°(K ,R), defined by x H> \nip(-,x).) 

Let x,x' G X and let M := e^^A^rc) and M' := M {a) (x') . We thus have 

k(M) = K(M (a \x)) < e a , 

and likewise k(M') < e a . Since ?/>(•, x') is homogeneous of degree —1 we obtain 

ip{M,x') < e w(a; ' x "V(M,x) = e^'^ie^'^M^ix)^) < 1, 
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and likewise ip(M',x) < 1. Furthermore since the function <3> defined by (6.81) is homo- 
geneous of degree r we obtain 

L (a \x') < $(M) = e r ^ x ' x,) L^{x), 

which implies 

$(M') = ^(e^ x,) M {a \x)) = e ruj{x ' xr) L {a \x') < e 2r ^ x ' ] L {a \x). 

Consider a converging sequence in I, i n -)> x, and define 

M n := e u{x ' Xn) M {a) (x n ). 

The arguments above show that M n is a minimizing sequence for the optimization problem 
L^ a \x). The arguments developed for the existence of a minimizer show that {M n ; n > 0} 
is contained in the compact set Kp if the constant > is sufficiently large, and that 
any converging subsequence of (M n )„>o tends to a minimizer of the optimization problem 
L^ a '(x), thus to MS a >{x) by uniqueness. 

Since the sequence M n takes its values in the compact set Kp and has the single 
adherence value Ai^ a '(x), we have the convergence M n — > Ai^ a \x) as n — > oo. Since 
uj(x,x n ) — )■ we obtain as announced 

M {a \x n ) ^M (a) {x), 

which concludes the proof of this proposition. o 



Remark 6.5.11. One easily checks that for each tt £ JH m , each a £ (0, oo) and each 
X £ E* we have K^(Xtt) = XL^(n) and 

M {a) (Xn) = X™M {a) {n). 

It follows that Ai^ a \iT) — > as n — > 0, and therefore that the map from M m to Sd defined 
by 

71 ^\ i fn = 

is continuous. Likewise L a and L^ are 1-homogeneous on H m , and Aii and M. 



(a) 



are 



m 

on M,, 



j homogeneous on M m which implies that they tend to as n — > 0, and are continuous 



6.5.4 Construction of the metric 

This subsection is devoted to the construction of a metric H which is adapted to a 
given smooth function / in the sense that the error en(f) P (resp. e^(V/%, resp. e 9 H (Vf) p ) 
is optimally small among all metrics of the same mass m(H) up to a fixed multiplicative 
constant. Our construction only applies to an interpolation large mass of the metric H. We 
do not establish that it is optimal, but this is suggested by the lower error estimates given 
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in Chapters 2 and 3 for the finite element interpolation error on admissible sequences of 
meshes. Producing a such a metric of a given mass independent of / is so far an open 
question. 

Our first lemma shows that the regularity constraints defining by the classes H a and 
Hg of metrics "disappear" when one multiplies a metric with a large constant. 

Lemma 6.5.12. For any H G H per which is C 1 , there exists Xq = Xq(H) such that 
X 2 H G H a for all X > X . 

Proof: The following maps are C 1 since they are the composition of C 1 maps 



z e 



\-> H(z) 



eS d 



z G R d h-> <p x (H(z)) := (u i->- In \\u\ 



H(z). 



G C°(S, 



where § := {u G Mr ; \u\ = 1} denotes the unit euclidean sphere (see £5.3 in the previous 
Chapter for a more in depth discussion on the map (p x ). Since H is C 1 and 2Z d -periodic 
these maps are uniformly Lipschitz, hence there exists two constants C+, C x such that for 
all z, z' G R d 



d+{H{z),H{z')) := \\H{z)-2 - H{z')-^\\ 

d x {H{z),H{z')) := \\ip x {H{z)) - ip x (H{z'))\\ L oo {s) 



< C+\z-z'\ 

< C x \z-z'\ 



Furthermore, since H is continuous and periodic, there exists e > such that H(z) > e 2 Id 
for all z G W 1 . Hence 

dn(z, z') > e\z — z'\, 



for all z,z' G M d . From this point, it follows from Remark 5.1.13 in the previous chapter 
that X 2 H G H a for all A > A := max{C + , C x /e}. 



Given a function / G C™ r , an exponent 1 < p < oo, and a triplet s = (a, p, 5) G 
of parameters we construct a C°° metric H as follows. For each z G M. d we define 

M{z) :=M (a) {7T z ) + pld, 



t>* \3 



where the homogeneous polynomial ti z G H m is defined by (6.57). Note that M is conti- 
nuous according to Remark 6.5.11 We consider a fixed mollifier ip, namely a radial non- 
negative compactly supported C°° function of integral one, and we denote ip$ := / 5)/S d . 
We then define 



H(z) := {det M'(z))™p+ d M'(z) where M' := ip s * M. 



(6.82) 



Given a symbol -k G {a, g} we define similarly M+ := A^i a \7r 2 ) + pld, Ml := ^ * M + 
and 

H*{z) := (det M^(^)) (™ s ^+ d M^(z) . 

The rest of this subsection is devoted to the proof of the following theorem, which 
immediately implies the asymptotic estimates (6.18), (6.19) and (6.20) announced in the 
introduction. 
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Theorem 6.5.13. There exists a constant C = C(m, d) such that the following holds for 
any f G C™, any 1 < p < oo and any e > 0. If the triplet of parameters s = (a,p,8) is 
well chosen then 

1. Defining r by - := ^ + - we have 



lim m(\ 2 H)^e x2H (f) p < C\\K(d m f)\\ L , m]d) + e. 



A— >oc 



(6.83) 



2. Defining r by - := ^^—^ + - we have 



lim m(X 2 H a )^e a x2Ha (Vf) p < C\\L a {d m f) \\ L , 



A— >oo 



([0,l] d ) 



d\ + e 



and 



lim m(A H, 

A— s>oo 



9) G X 2 H„ 



,{f)p<C\\L g (irf)\\ Lrmm +e. 
The proof of the three estimates are completely similar. We therefore focus on the 



proof of (6.83), and the details of the two other estimates are left to the reader. For 
each z G M. d we identify the polynomial ir z with the collection d m f(z)/m\ of m-th order 
derivatives of / at z, and we observe that 

\\det(M (a) (d m f)) 



L-([0,l] d ) 



a— >oo 
< 



\\K^(d m f)\\ LTm]d) 

\\K € (d m f)\\ L r m]) 
C K \\K(d m f)\\ L r m]) 



where the constant Ck is defined in (6.78). In the second line we used the fact that 



converges pointwise decreasingly to K (d m f), which implies the convergence 
of the integrals. We may therefore choose a sufficiently large and p sufficiently small in 
such way that 

||(detM)S|| iT([0jl]d) <C\\K(d m f)\\ L r m]) +e, 
where C = Ck/ttiI (recall that Ai^ and are homogeneous functions, as observed in 



Remark 6.5.11 ). 

For each z G M. d such that n z ^ we have 



\n z o M(z)-2\ 



sup 

\u\=l 



sup 



\ n z(u)\ 



u 



M{z) 



ir z (u)\ 



< sup 
M=i \\ u 



W z o M(n z 



< 1. 



We may therefore choose 5 > sufficiently small in such way that \\n z o M'(z) 2 1| < 1 for 
all z G [0, l] d and such that 



||(detM'(z))S||^ ([0)1]d) < C\\K{d m f)\\ LT{m) +e. 



(6.84) 
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Note that according to (6.82) 



m(H)r = \\(detM\z))^\\ LT n om . 



We define for each z e 



E 

|a|<m 



d\ a \f(z) Z° 
(dz) a a\ 



(6.85) 



and for each A > 

f z ,x{h):=\ m {f{z + h/\)-ii z {h/\)). 
Using the isotropic change of variables h n> z + hj A, we obtain 

e\*H(f; z) P = mf ||/ - v\\lp(b h ( z ,i/\)) = X~ {m+ p ] inf \\f zX - ^\\ lp{Bh{z)) . 
/ieP ra -i ^eP m _i 

Consider R > such that H > Id /R 2 uniformly on [0, l] d , hence on M d . The function f ZjX 
converges uniformly to tt z on B(0, R) : 



lim \\f ZjX - iTz\\l°°(b(o,r)) = 0, 



A— >oo 



(6.86) 



since ir z is the m-th term in the Taylor development of / at z. Since Bh{z) C B(0, R) we 
therefore obtain 



lim X m pe X 2 H (f; z\ 

A— »oo 



lim X m+ v inf \\f z ,\- v\\lp(b h ( z )) 

A-s>oo /ieP m _i 

inf \\tt z - h\\lp(b h (z)) 



< Ik 



z\\LP(B H (z)) 

< \B h (z)\p\\tt z \\ L c C{Bh{z)) 

= oj (det H(z))~*>\\ir z o H{z)~^\\ 

= oo\\tt z o M(z)~^ II 

< 

where u := |-B(0, 1)| denotes the volume of the unit euclidean ball. We used in the third 
line the proportionality relation H(z) = (det M(z))~ m p+ d M(z) and the fact that tt z is 
homogeneous of degree m, which imply 

mp 1 m i 

detH(z) = detM(z)™p+ d and ||7r 2 o H{z)~* \\ = {det M{z)) 2 ^p+ d )\\v: z o M(zY^\\. 
The convergence (6.86) is uniform over all z G [0, l] d , and therefore 

lim X mp e X 2 H (fY p = lim X mp [ y/det(X 2 H(z))e x , H (f; zf p dz 

A->oo * A^oo >/[0,l] <i 

= / y/ det H(z) ( lim A"^ + <W(/; dz 

J, Q1]d VA^oo / 



det H{z)dz. 



'[o,i] d 
oo p m(H). 



322 



Chapter 6. Approximation theory based on metrics 



Since m(X 2 H) = X d m(H) we therefore obtain injecting (6.84) 



lim m{X 2 H)^e X 2 H {f)p < com(H) 



d p 



C\\K(d m f)\\ L r m]d) + 



which concludes the proof. 



6.5.5 The shape function is equivalent to a continuous function 



The shape functions K, L a and L g are defined in £6.1.5 by an infimum, which may 
or may not be attained, over the collection SL^ of all matrices of unit determinant. One 
can infer from this property that the shape functions are upper-semi continuous, but they 



are not continuous in general as illustrated in Remark 2.4.7 (in Chapter 2) for the shape 
function K when m = 4 and d = 2. 

The purpose of this subsection is to establish the following theorem, which states that 
the shape functions are nevertheless uniformly equivalent to a continuous function on H m . 

Theorem 6.5.14. There exists a constant C = C(m,d) and three continuous functions 
K c , L c a and L c g on M m such that for all ir G M m 

C^K^n) < K(n) < CK c (n) 
C- l LlM < LJn) < CL°M 



C-^tt) < L 9 (n) < CL c g 



7T 



Note that in the case of the shape functions K and L a , in dimension d = 2, this 
theorem is a consequence of the explicit algebraic equivalents of these shape functions 
given in Chapters 2 and 3. 

The analysis presented below applies to all three shape functions, without restriction 



on the dimension d. As observed in £6.5.3 the shape functions L a and L g are uniformly 
equivalent on H m to the functions L* a and L* respectively, defined for all tt e E m by 

L* a (n) := inf vW^W|f, (6-87) 
:= inf {{A' 1 \\ \\n o A\\, (6.88) 

AfcbL d 

where G(vr) := |V7r| 2 G H 2? n-2- Let us also recall that 

K(ir) := inf ||ttoA||. (6.89) 

AeSL d 

The fact that L* is uniformly equivalent to a continuous function therefore directly follows 
from the same property for the shape function K (in degree 2m — 2 instead of m), by 
composition with the continuous functions G : H m — > H 2m _ 2 and yf- : R + — > IR+. 

The next lemma gives a criterion on the lower semi-continuous envelope of a function, 
which guarantees that it is equivalent to a continuous function. 
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Lemma 6.5.15. Let (X,dx) be an arbitrary metric space, and let f : X — > R + . We 

define the lower semi continuous envelope f of f as follows : for all x G X 

f(x) := lim inf f(y). 

— e^Oy£B(x,e) 

Assume that f is upper semi- continuous and that there exists a constant C such that 

f(x) < Cf{x) or f(x) = (6.90) 

for all x G X. Then there exists a continuous function f c G C°(X, R+) such that 

C~7 C < / < Cf c . (6.91) 

Proof: For each n G TL we define the set 

E n := {x G X ; f(x) > C n } } (6.92) 



which is closed since / is upper semi-continuous. For any x G X \ E n there exists a 
sequence (xk)k>o, %k G X \ E n (hence f{x^) < C), converging to x. Therefore 

f(x) <hminf/(x fc ) < C. 

— k— >oo 



Combining this with (6.90) we obtain that x ^ E n+ i, hence the closed sets E n+ \ and 
X\E n are disjoint. We denote by (r n ) ne ^, r n G C°(X, [0, 1]), a family of continuous 
functions such that 

r n \E n+1 = 1 and r n{ ^ = 0. 
The simplest construction of such functions is the following : 

<ix(a;,X\E n ) 

r n (x) :-- 



d x (x,X \ E n ) + <ix(a:,-E„+i)' 

where c?x(^, -E) := inf{<ix(^, e) ; e G -E}. We define a function r : X — > [— oo, oo) (observe 
that — cxd is included) by the sum 

r(x) := y^(r n (a:) - 1) + ^r n (x). 

n<0 n>0 

For each n G 5Z the function r is equal to n + r n + r n+ i on the open set mt(E n \ E n+ 2). 
Furthermore 

E n+1 \ E n+2 G{xeX; f(x) < C n+2 and f_{x) > C n } C mt(E n \ E n+2 ), 
therefore r is continuous on the reunion 

|J mt(E n \ E n+2 ) D |J E n+1 \ E n+2 = {x G X ; f{x) > 0}. 

Consider x G X such that f(x) = 0, hence r(x) = — oo. Since / is upper semi-continuous 
there exists for each n G TL a constant 5 > such that / < C n on -B(x, 5). It follows that 
r < n on _B(x, 5), hence that r is continuous at x, in the topology of [— oo, oo). 
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The function r is therefore continuous on the whole space X. Since C > 1 the function 
f c : X ->■ M + defined by 

/ c ( x ) : = C"^, 

is therefore also continuous. We have f c (x) = if and only if r(x) = — oo, which is 
equivalent to f(x) = 0. Furthermore for each n G 2Z and each x £ E n \ E n +i we have 



C n < f(x) < C n+1 and C n < f c (x) < C n+1 by construction, which establishes (6.91) and 



concludes the proof of this lemma. o 

The proofs that the functions K and L* are uniformly equivalent on H m to a continuous 
function are extremely similar. The case of L* is nevertheless slightly more involved due 



to the term ||v4 _1 || appearing in (6.88) which is not present in (6.89). We therefore focus 
our attention on L*, and we leave to the reader the details of the adaptation of the proof 
to the shape function K. 

We define an auxiliary function £ on M d x H m as follows : for all (B,ir) G M d x !H m 

£(5,7r):= inf || A -1 ^ || ||tt o A\\, (6.93) 

A&SL d 

hence £(Id, tt) = L g (n) for all n G M m . 

The function £ is upper semi continuous since it is defined as the infimum of a family 
of continuous functions. Hence for any converging sequence (B n , n n ) — > (B, tt) in Md x H m 
we have 

£(-£>? tt) > limsup £(B n , n n ). 

n—toc 

The next lemma shows that this inequality becomes an equality if the limit is zero. 

Lemma 6.5.16. For any converging sequence (B n , ir n ) — > (B, it) in M d x M m the following 
holds : 

if lim £(B n , Ti n ) = then £(5, tt) = 0. 

n— toe 

Proof: If £(£>„, 7T n ) —> then there exists a sequence (A n ) n >o, A n G SL d , such that 

lim WA^BnWhnoAnW = 0. (6.94) 



For each n we consider the principal value decomposition A n = U n D n V n , where U n , V n G 
O d and D n is a diagonal matrix satisfying detD n = 1, with positive diagonal coefficients 
t n = (tn,i, ■ ■ ■ ,t n ,d)- We have for any n > 0, since V n is orthogonal, 

WA^BnWW^oAnW = \\^D- 1 T%B n \\\\>Ko{U n D n V n )\\ 

= \\D- l UjBn\\ho{UnD n )\\. 



Since the collection Od of orthogonal matrices is compact, the sequence of orthogonal 



matrices U n admits a converging sub-sequence U^ui) ~ > U. We define B' := U T B, tt' :- 



7r o U, and we observe that since | det U\ = 1 

C(B,tv) = C(B',7r'). (6.95) 
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We also define for each n 

B' n '■= U^ n )B^ n ), ^' n := r K t p{n)°U v {n), and D' n := D v{n)l 
in such way that 

lim^XHOBV) and lim \\D'- l B' n \\ ||< o D' n \\ = 0. (6.96) 



We thus recognize our starting point (6.94) except that the matrices A n are replaced with 
diagonal matrices D' n , of coefficients 

In order to avoid notational clutter we do not keep track of the sub-sequence extraction, 
and until the end of this proof we write tt, tt h , B, B n , D n , t n and t n ^ for the variables tt' , 
<, B> 7 B' n , D' n1 t' n and t'^. 

We denote by / C {1, • • • , d} the collection of indices i such that the associated line 
of B is nonzero. There exists a constant c > such that for all n sufficiently large 

WD-'BJ >cmax^. (6.97) 

We denote by A the collection of exponents 

A := {a G TL% ; \a\ < m} 
and by n a , 7r n Q , the coefficients of tt and 7r„ respectively : 

tt = 2_,'R a Z a and 7r n . = 7r ra;a Z a . 



It follows from (6.96) and (6.97) that for any a G A we have 

lim ( maxt-n 7T na t£ = 0. (6.98) 

For each t — (ti, • - • , t^) G IR+ we define 

A(t) := ja G A ; ^maxtr 1 ^ t a > 1 

The set A has only a finite number of subsets, since it is finite. Hence there exists an 
extraction ip such that A* := A(t^( n )) does not depend on n. In view of (6.98) we thus 
have for all ct£ A, 



7T„ = lim TT.hrn.Yr, = 0. 



n->-oo 



Defining D : = A/,(o), and denoting by t = (t\, ■ ■ ■ ,td), its diagonal coefficients we thus 
obtain 



lim ||Zr n £||||7ro.D n || = 0. 

n— ¥00 
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Indeed \\D- n B\\ < Cmax ie; r n and 



(m&xtrA n a t na = ir a (t a maxt; 1 ^ 



which equals if a G A*, and tends to zero if a G A \ A*. It follows that £(B, 77) = as 
announced. o 

The next lemma compares the function £ with its lower semi-continuous envelope. 
Lemma 6.5.17. There exists a constant C such that £ < C£ on Md X M m . 
Proof: We denote by A the following compact subset of Md x H m 
A := {(£,71-) G M d x M m ■ \\B\\ = ||7r|| = 1 and ||tt o A|| > 1 for all A G SL d }. 



It follows from the definition (6.93) of £ that £(B,tv) = 1 for all (B,tt) G A. Therefore 



£ does not vanish on A according to Lemma 6.5.16 Since £ is lower semi continuous, it 
attains its minimum on A, we thus define 

C- 1 :=inf£. 

A ~ 

Let {B,tt) G Ma x H m , and let (A n ) n > , A n G SL^, be a sequence such that 

lim IIAr^llllTro An\\ = £(B,ir) := inf ||tt o A\\. (6.99) 

n— >oo AeSL d 

If £(-B, 77 ) = 0, then £(£, 77) = and there is nothing to prove, we may therefore assume 
that £(-B, 77) > 0. We have for each n > 



£ 



A~ X B 77oA„ \ £(5 



,77 



A- l B\V Ik o A, II ) \\A- X B\\\W o A r 



which tends to £(B, 77) /£ (B, 77) as ti 4 00. Consider an extraction such that the 
following quantities converge 

lim _, = B* and lim - = 77*. 

n^oo \\A^ n ,B\\ n^oo \\n O A^ n )\\ 

We now prove that (5*, 77*) G A. We first remark that ||£>*|| = |k*|| = 1 by construc- 
tion. Assume for contradiction that there exists A G GL^ such that HA -1 !?*!! Ik* o A|| = 
5 < 1. We obtain 

lim \\{A v(n) Ay l B\\\\K o (A v(n) A)\\ 

n-»oo 

I^^H^H ||(77 ^ {ri) )oA| 



„ v yw ^ nv- -w; _^N || A -i n ,B||||7roA. 
— 11^)^11 lko^ (n) || 11 <*0 1111 * 

<5£(-B, 77) 
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which contradicts the definition (6.93) of £, hence (5*,7r*) G A as announced. We thus 
obtain, since £ is lower semi continuous, 



lim £ 



A n lB KoA n , 



If follows that C£(B, tt) > £(5,7r), which concludes the proof of this lemma. 



o 



We now conclude the proof that L* is equivalent to a continuous function. Applying 
Lemma 16.5.151 to the function £ and the constant C + 1, where C is the constant from 



Lemma 6.5.17 we obtain that that £ is equivalent to a continuous function on Md x Hi r , 



Recalling that L* (ic) = £(Id, tt) we obtain that L* is also equivalent to a continuous 



function, on H m , which concludes the proof of Theorem 6.5.14 
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Adaptive multiresolution analysis 
based on anisotropic triangulations 
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7.1 Introduction 

Approximation by piecewise polynomial functions is a standard procedure which occurs 
in various applications. In some of them such as terrain data simplification or image 
compression, the function to be approximated might be fully known, while it might be 
only partially known or fully unknown in other applications such as denoising, statistical 
learning or in the finite element discretization of PDE's. 

In all these applications, one usually makes the distinction between uniform and adap- 
tive approximation. In the uniform case, the domain of interest is decomposed into a 
partition where all elements have comparable shape and size, while these attributes are 
allowed to vary strongly in the adaptive case. In the context of adaptive triangulations, 
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another important distinction is between isotropic and anisotropic triangulations. In the 
first case the triangles satisfy a condition which guarantees that they do not differ too 
much from equilateral triangles. This can either be stated in terms of a minimal value 
6q > for every angle, or by a uniform bound on the aspect ratio 

hr 

Pt ■= — 

r T 

of each triangle T where Ht and tt respectively denote the diameter of T and of its largest 
inscribed disc. In the second case, which is in the scope of the present chapter, the aspect 
ratio is allowed to be arbitrarily large, i.e. long and thin triangles are allowed. In summary, 
adaptive and anisotropic triangulations mean that we do not fix any constraint on the 
size and shape of the triangles. 

Given a function / and a norm || • ||x of interest, we can formulate the problem of 
finding the optimal triangulation for / in the X-norm in two related forms : 

- For a given N find a triangulation Tn with N triangles and a piecewise polynomial 
function f N (of some fixed degree) on Tn such that ||/ — /jy||x is minimized. 

- For a given e > find a triangulation Tn with minimal number of triangles N and 
a piecewise polynomial function f N such that ||/ — /n\\x < £■ 

In this chapter X will be the LP norm for some arbitrary 1 < p < oo. The exact solution 
to such problems is usually out of reach both analytically and algorithmically : even when 
restricting the search of the vertices to a finite grid, the number of possible triangulations 
has combinatorial complexity and an exhaustive search is therefore prohibitive. 

Concrete mesh generation algorithms have been developed in order to generate in 
reasonable time triangulations which are "close" to the above described optimal trade-off 
between error and complexity. They are typically governed by two intuitively desirable 
features : 

1. The triangulation should equidistribute the local approximation error between each 
triangle. This rationale is typically used in local mesh refinement algorithms for 



numerical PDE's 90 : a triangle is refined when the local approximation error 
(estimated by an a-posteriori error indicator) is large. 

2. In the case of anisotropic meshes, the local aspect ratio should in addition be op- 
timally adapted to the approximated function /. In the case of piecewise linear 
approximation, this is achieved by imposing that the triangles are isotropic with 
respect to a distorted metric induced by the Hessian d 2 f. We refer in particular 



to 16 where this task is executed using Dclaunay mesh generation techniques. 

While these last algorithms fastly produce anisotropic meshes which are naturally 
adapted to the approximated function, they suffer from two intrinsic limitations : 

1. They are based on the evaluation of the Hessian d 2 f, and therefore do not in principle 
apply to arbitrary functions / G L P (Q) for 1 < p < oo or to noisy data. 

2. They are non-hierarchical : for N > M, the triangulation Tn is not a refinement of 
7m- 

One way to circumvent the first limitation is to regularize the function /, either by 
projection onto a finite element space or by convolution by a mollifier. However this raises 
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the additional problem of appropriately tuning the amount of smoothing, in particular 
depending on the noise level in /. 

The need for hierarchical triangulations is critical in the construction of wavelet bases, 
which play an important role in applications to image and terrain data processing, in 



particular data compression 31 . In such applications, the multilevel structure is also of 
key use for the fast encoding of the information. Hierarchy is also useful in the design 
of optimally converging adaptive methods for PDE's [l2l|50l[74l|87]. However, all these 



developments are so far mostly restricted to isotropic refinement methods. Let us mention 



that hierarchical and anisotropic triangulations have been investigated in 64 , yet in this 
work the triangulations are fixed in advance and therefore generally not adapted to the 
approximated function. 

A natural objective is therefore to design adaptive algorithmic techniques that combine 
hierarchy and anisotropy, and that apply to any function f e L P (Q), without any need for 
regularization. 

In this chapter we propose and study a simple greedy refinement procedure that achieves 
this goal : starting from an initial triangulation V , the procedure bisects every triangle 
from one of its vertices to the mid-point of the opposite segment. The choice of the vertex 
is typically the one which minimizes the new approximation error after bisection among 
the three options. 

Surprisingly, it turns out that - in the case of piecewise linear approximation - this 
elementary strategy tends to generate anisotropic triangles with optimal aspect ratio. This 
fact is rigorously proved in Chapter [8] which establishes optimal error estimates for the 
approximation of smooth and convex functions f G C 2 , by adaptive triangulations T/v with 
N triangles. These triangulations are obtained by consecutively applying the refinement 
procedure to the triangle of maximal error. The estimates in Chapter [8] are of the form 

1 1 



< CiV-l V|det(d 2 /)Hk, - = - + 1, (7.1) 

r p 

and were already established in [4 27 for functions which are not necessarily convex, 
however based on triangulations which are non-hierarchical and based on the evaluation 
of d 2 f. Note that (7.1) improves on the estimate 



\\f-f N \\ L p <CN~ l \\d 2 f\\ L r, - = - + 1. (7.2) 

Tp 



which can be established, see £8.2 in the next chapter, for adaptive triangulations with 



isotropic triangles, and which itselfs improves on the classical estimate 

\\f-fN\\ L *<CN- l \\d 2 f\\ LV , (7.3) 

which is known to hold for uniform triangulations. 

The main objective of the present chapter is to introduce the refinement procedure 
as well as several approximation methods based on it, and to study their convergence for 



an arbitrary function f & L p . In £7.2 we introduce notation that serves to describe the 
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refinement procedure and define the anisotropic hierarchy of triangulations ('Dj)j>o- We 
show how this general framework can be used to derive adaptive approximations of / either 
by triangulations based on greedy or optimal trees, or by wavelet thresholding. In £7.3 we 
show that as defined, the approximations produced by the refinement procedure may fail 
to converge for certain / G LP and show how to modify the procedure so that convergence 
holds for any arbitrary / G L p . We finally present in t7_A some numerical tests which 
illustrate the optimal mesh adaptation, in the case of piecewise linear elements, when the 
refinement procedure is applied either to synthetic functions or to numerical images. 



7.2 An adaptive and anisotropic multiresolution fra- 
mework 

7.2.1 The refinement procedure 

Our refinement procedure is based on a local approximation operator At acting from 
L P (T) onto P m - the space of polynomials of total degree less or equal to m. Here, the 
parameters m > and 1 < p < oo are arbitrary but fixed. For a generic triangle T, we 
denote by (a, b, c) its edge vectors oriented in clockwise or anticlockwise direction so that 

a + b + c = 0. 

We define the local L p approximation error 

er(/) P := \\f - A T f\\v>{T). 

The most natural choice for At is the operator Bt of best L P (T) approximation which is 
defined by 

11/ - B T f\\LP(T) = min ||/ - vr|| L p (T) . 

However this operator is non-linear and not easy to compute when p ^ 2. In practice, one 
prefers to use an operator which is easier to compute, yet nearly optimal in the sense that 

11/ - A T f\\ LP{T) < C inf ||/ - tt|U p(t) , (7.4) 

with C a Lebesgue constant independent of / and T. Two particularly simple admissible 
choices of approximation operators are the following : 

1. At = Pt, the L 2 (T)-orthogonal projection onto P m , defined by Prf G P m such 
that j T (f — Pt/)*7T = for all 7r G P m . This operator has finite Lebesgue constant 
for all p, with C = 1 when p = 2 and C > 1 otherwise. 

2. At = It, the local interpolation operator which is defined by It/ G P m such that 
hfil) = f(j) for all 7 G S := {J2 ; ki G IN, J2 h = m} where {vi,v 2 ,v 3 } are 
the vertices of T (in the case m = we can take for £ the barycenter of T). This 
operator is only defined on continuous functions and has Lebesgue constant C > 1 
in the L°° norm. 
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All our results are simultaneously valid when At is either Pt or It (in the case where 
p = oo), or any linear operator that fulfills the continuity assumption ( |7.4[ ). 

Given a target function, our refinement procedure defines by induction a hierarchy 
of nested triangulations CPj)j>o with #(T>j) = 2^(V ). The procedure starts from the 
coarse triangulation T>q of Q, which is fixed independently of /. When Q = [0, l] 2 we may 
split it into two symmetric triangles so that #(T> ) = 2. For every T G TJj, we split T 
into two sub-triangles of equal area by bisection from one of its three vertices towards the 
mid-point of the opposite edge e G {a,b,c}. We denote by T], and T e 2 the two resulting 
triangles. The choice of e G {a, b, c} is made according to a refinement rule that selects 
this edge depending on the properties of /. We denote by 1Z this refinement rule, which 
can therefore be viewed as a mapping 

K:(f,T)^e. 

We thus obtain two children of T corresponding to the choice e. X^+i is the triangulation 
consisting of all such pairs corresponding to all T e Dj. 

In this chapter, we consider refinement rules where the selected edge e minimizes a 
decision function e h- >■ dy(e, /) among {a, b, c}. We refer to such rules as greedy refinement 
rules. A more elaborate type of refinement rule is also considered in §7.3.3 

The role of the decision function is to drive the generation of anisotropic triangles 
according to the local properties of /, in contrast to simpler procedures such as newest 
vertex bisection (i.e. split T from the most recently created vertex) which is independent 
of / and generates triangulations with isotropic shape constraint. 

Therefore, the choice of dr{e, f) is critical in order to obtain triangles with an optimal 
aspect ratio. The most natural choice corresponds to the optimal split 

d T (e,f)=e n (fy p + e Ti (fy p , (7.5) 

i.e. choose the edge that minimizes the resulting LP error after bisection. It is proved in 
Chapter [8] in the case of piecewise linear approximation, that when / is a C 2 function 
which is strictly convex or concave the refinement rule based on the decision function 

d T (e, f) = \\f- lTlf\W(TD + 11/ - InfWitW ( 7 - 6 ) 

generates triangles which tend to have have an optimal aspect ratio, locally adapted to 
the Hessian d 2 f. This aspect ratio is independent of the IP norm in which one wants to 
minimize the error between / and its piecewise affine approximation. 

Remark 7.2.1. If the minimizer e is not unique, we may choose it among the multiple 
minimizers either randomly or according to some prescribed ordering of the edges (for 
example the largest coordinate pair of the opposite vertex in lexicographical order). 

Remark 7.2.2. The triangulations T>j which are generated by the greedy procedure are in 
general non- conforming, i.e. exhibit hanging nodes. This is not problematic in the present 
setting since we consider approximation in the L p norm which does not require global 
continuity of the piecewise polynomial functions. 
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The refinement rule 7Z defines a multiresolution framework. For a given / e L P (Q) 
and any triangle T we denote by 

C(T) :={T\,T 2 }, 

the children of T which are the two triangles obtained by splitting T based on the pres- 
cribed decision function d T (e,f). We also say that T is the parent of Ti and T 2 and 
write 

T = 7>(Ti) = P(T 2 ). 

Note that 
We also define 

2? := Uj> V 3 , 

which has the structure of an infinite binary tree. Note that Vj depends on / (except for 
j = 0) and on the refinement rule 7Z, and thus V also depends on / and 1Z : 

Vj = Vj(f,K) and V = V(f,1Z). 

For notational simplicity we sometimes omit the dependence in / and 1Z when there is 
no possible ambiguity. 

7.2.2 Adaptive tree-based triangulations 

A first application of the multiresolution framework is the design of adaptive anisotro- 
pic triangulations Tn for piecewise polynomial approximation, by a greedy tree algorithm. 
For any finite sub-tree S C V, we denote by 

£{S) :={TeS s.t. C(T) i S} 

its leaves which form a partition of Q. We also denote by 

X{S) :=S\C(S), 

its inner nodes. Note that any finite partition of Q by elements of V is the set of leaves 
of a finite sub-tree. One easily checks that 

#(5) = 2#(£(S)) - N . 

For each N, the greedy tree algorithm defines a finite sub-tree Sn of T> which grows 
from Sn '■= T^o — Tn , by adding to iSat_i the two children of the triangle T^_ 1 which 
maximizes the local IP-error er{f) p over all triangles in 7jv-i- 

The adaptive partition Tn associated with the greedy algorithm is defined by 

Tn '■= £(Sn)- 

Similarly to V, the triangulation Tn depends on / and on the refinement rule 7Z, but also 
on p and on the choice of the approximation operator At- We denote by fx the piecewise 
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polynomial approximation to / which is defined as Arf on each T £ Tn- The global L p 
approximation error is thus given by 

11/ - In\\lp = \\(e T (f)p)\\£p(T N )- 

Stopping criterions for the algorithm can be defined in various ways : 

- Number of triangles : stop once a prescribed N is attained. 

- Local error : stop once er{f)p < e for all T G Tn, for some prescribed e > 0. 

- Global error : stop once ||/ — /jv||ip < £ for some prescribed e > 0. 

Remark 7.2.3. T7ie role of the triangle selection based on the largest eT(f) p is to equi- 
distribute the local LP error, a feature which is desirable when we want to approximate f 
in L P (Q) with the smallest number of triangles. However, it should be well understood that 
the refinement rule may still be chosen based on a decision function defined by approxima- 
tion errors in norms that differ from LP . Ln particular, as explained earlier, the decision 



function (7.6) generates triangles which tend to have have an optimal aspect ratio, locally 
adapted to the Hessian d 2 f when f is strictly convex or concave, and this aspect ratio is 
independent of the L p norm in which one wants to minimize the error between f and its 
piecewise affine approximation. 

The greedy algorithm is one particular way of deriving an adaptive triangulation for 
/ within the multiresolution framework defined by the infinite tree T>. An interesting 
alternative is to build adaptive triangulations within T> which offer an optimal trade- 
off between error and complexity. This can be done when 1 < p < oo by solving the 
minimization problem 



mm 

s 



{ E M/)£ + A#(S)} (7.7) 



Te£(S) 



among all finite trees, for some fixed A > 0. In this approach, we do not directly control the 
number of triangles which depends on the penalty parameter A. However, it is immediate 
to see that if N = N(X) is the cardinality of T N = C(S*) where S* is the minimizing tree, 
then T N minimizes the LP approximation error 



T N := argmin^e T (/)^ 



#{T)<N — r 



where the minimum is taken among all partitions T of Q within T> of cardinality less than 
or equal to N. 



Due to the additive structure of the error term, the minimization problem (7.7) can 



be performed in fast computational time using an optimal pruning algorithm of CART 



type, see 17,49 . In the case p = oo the associated minimization problem 

eT(/)oo + A#(5)|, (7.8) 

i ) 



mm< sup 



s I 



reC(S) 



can also be solved by a similar fast algorithm. It is obvious that this method improves 
over the greedy tree algorithm : if iV is the cardinality of the triangulation resulting from 
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the minimization in (7.7) and /jy the corresponding piecewise polynomial approximation 
of / associated with this triangulation, we have 

||/-&IUp<||/-/*IU>, 

where /jv is built by the greedy tree algorithm. 



7.2.3 Anisotropic wavelets 

The multiresolution framework allows us to introduce the piecewise polynomial mul- 
tiresolution spaces 

Vj = Vj{f,K) := {g s.t. g\ T e P m , T E V,}, 

which depend on / and on the refinement rule 1Z. These spaces are nested and we denote 
by 

v = v(f,n) = u J > v J (f,n), 

their union. For notational simplicity, we sometimes omit the dependence in / and 1Z 
when there is no possible ambiguity. 

The Vj spaces may be used to construct wavelet bases, following the approach intro- 
duced in [I] and that we describe in our present setting. 

The space Vj is equipped with an orthonormal scaling function basis : 

<Pt, i = l,— ,^(m + l)(m + 2), T 6 Vj, 

where the tp l T for i = 1, • • • , |(m + l)(m + 2) are supported in T and constitute an 
orthonormal basis of P m in the sense of L 2 (T) for each T 6D. There are several possible 
choices for such a basis. In the particular case where m = 1, a simple one is to take for T 
with vertices (vi, i>2, V3), 

(p^vA = \T\- 1/2 V3 and ^(vj) = -\T\- 1/2 Vz, j ^ i. 

We denote by Pj the orthogonal projection onto Vj : 

We next introduce for each T G T>j a set of wavelets 

*Pt, i = I,--- ,i(m+l)(m + 2), 

which constitutes an orthonormal basis of the orthogonal complement of P m (T) into 
P m (T') © P m (T") with {T',T"} the children of T. In the particular case where m = 1, 
a simple choice for such a basis is as follows : if (^1,^2,^3) and (wi,W2,w 3 ) denote the 
vertices of T' and T", with the convention that v\ = w% and v% = W2 denote the common 
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vertices, the second one being the midpoint of the segment (v 3 ,w 3 ) (i.e. T has vertices 
(v 3 ,w 3 ,vi)), then 

<1>t 



V2 ' 

<p} r l-<PT'~' fi T" +iP T" 
2 



where and <p l T „ are the above defined scaling functions. 
The family 

1 



■0 



Ti 



(m + l)(m + 2), Te re- 



constitutes an orthonormal basis of Wj, the L 2 -orthogonal complement of in V J " +1 . A 
multiscale orthonormal basis of Vj is given by 



2 (m+l)(m+2),- 
T£Vj,j=0,- ,J-1 



Letting J go to +00 we thus obtain that 



=l,-,|(m+l)(m+2), 



is an orthonormal basis of the space 



-L 2 {n) 



L 2 (n) 



V(f,K) 2 :=V(f,K) ' = U j > V j (f,K) 
For the sake of notational simplicity, we rewrite this basis as 

Note that V(f, 71) is not necessarily dense in L 2 (Q) and so V(f, 72.) 2 is not always equal to 
L 2 (Q). Therefore, the expansion of an arbitrary function g e L 2 (Q) in the above wavelet 
basis does not always converge towards g in L 2 (Q). The same remark holds for the L p 
convergence of the wavelet expansion of an arbitrary function g E L P (Q) (or C(Q) in the 
case p = 00) : //-convergence holds when the space 

-L p (n) 



V(f,K) p :=V(f,K) 
coincides with L P (Q) (or contains C(Q) in the case p = 00), since we have 



f -^2 dx ^ j> 



= \\f-P J f\\ LP <Cml\\f-g\\ LP , 

LP g&Vj 



with C the Lebesgue constant in (7.4) for the orthogonal projector. 



A sufficient condition for such a property to hold is obviously that the size of all 
triangles goes to as the level j increases, i.e. 

lim sup diam(T) = 0. 
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However, this condition might not hold for the hierarchy (J)j)j>o produced by the refine- 
ment procedure. On the other hand, the multiresolution approximation being intrinsically 
adapted to /, a more reasonable requirement is that the expansion of / converges towards 
/ in L p (Vl) when / G L p {Vl) (or C(Q) in the case p = oo). This is equivalent to the property 

feV(f,TZ) p . 

We may then define an adaptive approximation of / by thresholding its coefficients at 
some level e > : 

fe ■= 

\h\>e 

where f\ := (/, When measuring the error in the L p norm, a more natural choice is to 
perform thresholding on the component of the expansion measured in this norm, defining 
therefore 

fe ■= Yl /aV'A- 

II/a^aIIlp>£ 

We shall next see that the condition / e V(f,1Z) p also ensures the convergence of the 
tree-based adaptive approximations J'n and towards / in L P (Q). We shall also see 
that this condition may not hold for certain functions /, but that this difficulty can be 
circumvented by a modification of the refinement procedure. 

7.3 Convergence analysis 

7.3.1 A convergence criterion 

The following result relates the convergence towards / of its approximations by pro- 
jection onto the spaces Vj(f,7t), greedy and optimal tree algorithms, and wavelet thre- 
sholding. This result is valid for any refinement rule 1Z. 

Theorem 7.3.1. Let 1Z : (/, T) \-± e be an arbitrary refinement rule and let f e L P {VL). 
The following statements are equivalent : 

(i) feV(f,K) p . 

(ii) The greedy tree approximation converges : limjv_> +00 ||/ — /jv \\lp = 0. 

(iii) The optimal tree approximation converges : lim^v^+oo 11/ — /ivlU p = 0- 
In the case p = 2, they are also equivalent to : 

(iv) The thresholding approximation converges : lim e ^ 11/ — /elk 2 — 0. 

Proof : Clearly, (ii) implies (iii) since ||/— f^\\Lp < ||/— /jv||lp- Since the triangulation 
is a refinement of 7^, we also find that (iii) implies (i) as inf^gy^ ||/ — g\\ip < ||/ — /JvIIlp- 
We next show that (i) implies (ii). We first note that a consequence of (i) is that 

lim sup er(/) p = 0. 

It follows that for any rj > 0, there exists N(rj) such that for iV > N(rj), all triangles 
T E Tn satisfy 

e T (f) P < V- 
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On the other hand, (i) means that for all e > 0, there exists J = J{e) such that 

inf \\f ~9\\lp < e. 

For N > N(rj), we now split Tn into U where 

T+ := Tat n {UfrjVj) and 7^ := 7^ fl {U^jVj). 
We then estimate the error of the greedy algorithm by 

\\f-f N \\ p LP = E e ^)p+ E e ^ 

TcT+ 

< CMnf \\f-g\\ p LP + v p #(W 

< C p e p + 2 J N oV p , 



where C is the stability constant of (7.4). This implies (ii) since for any 5 > 0, we can first 
choose e > such that C p e p < 5/2, and then choose rj > such that 2 J ^N rj p < 5/2. 
When p = oo the estimate is modified into 

11/ - /n\\l«> < max-tCV,?]}, 

which also implies (ii) by a similar reasoning. 

We finally prove the equivalence between (i) and (iv) when p = 2. Property (i) is 
equivalent to the L 2 convergence of the orthogonal projection Pjf to / as j — > +oo, or 
equivalently of the partial sum 

E-^A 

where |A| stands for the scale level of the wavelet Since (V , a)asa is an orthonormal 
basis of V(f, 72.) 2, the summability and limit of J2\eA /a^a do not depend on the order of 
the terms. Therefore (i) is equivalent to the convergence of f e to /. o 

Remark 7.3.2. The equivalence between statements (i) and (iv) can be extended to 1 < 
p < 00 by showing that (^a)asa is an L p -unconditional system. Recall that this property 
means that there exists an absolute constant C > such that for any finitely supported 
sequences (c\) and (d\) such \c\\ < \d\\ for all \, one has 

11 E ca ^ a " lp - ^11 E^ a II lp - 

A consequence of this property is that if f G LP can be expressed as the LP limit 

/ = lim E /aV'a, 

|A|<i 

then any rearrangement of the series J^/aV'a converges towards f in L p . This easily 
implies the equivalence between (i) and (iv). The fact that (VoOasa is an unconditional 
system is well known for Haar systems Wyj which correspond to the case m = 0, and can 
be extended to m > in a straightforward manner. 



342 



Chapter 7. Adaptive and anisotropic multiresolution analysis 



7.3.2 A case of non-convergence 

We now show that if we use a greedy refinement rule 1Z based on a decision function ei- 
ther based on the interpolation or L 2 projection error after bisection, there exists functions 
/ G L P {VL) such that / ^ V(f,1Z) p . Without loss of generality, it is enough to construct 
/ on the reference triangle T re f of vertices {(0, 0), (1, 0), (1, 1)}, since our construction can 
be adapted to any triangle by an affine change of variables. 

Consider first a decision function defined from the interpolation error after bisection, 



such as (7.6), or more generally 



d T (f, 



11/ 



%/IIlj>(ti) 



Let / be a continuous function which is not identically on T re f and which vanishes at all 
points (x, y) such that x = for k = 0, 1, • • • , 2m, where m is the degree of polynomial 
approximation. For such an /, it is easy to see that I?f = and that Ij"f = for all 
subtriangles X" obtained by one bisection of T re f. This shows that there is no preferred 
bisection. Assuming that we bisect from the vertex (0, 0) to the opposite mid-point (1, |), 
we find that a similar situation occurs when splitting the two subtriangles. Iterating this 
observation, we see that an admissible choice of bisections leads after j steps to a triangula- 
tion Vj of T ref consisting of the triangles T Jjfc with vertices {(0, 0), (1, 2 _J fc), (1, 2~ J (A; + 1)) } 
with k — 0, - • • ,23 — 1. On each of these triangles / is interpolated by the null function and 
therefore by (7.4) the best L°° approximation in Vj does not converge to / as j — > +oo, 
i.e. / ^ V(f,TZ) 00 . It can also easily be checked that / ^ V(f,lZ) p . 

Similar counter-examples can be constructed when the decision function is defined 
from the L 2 projection error, and has the form 



MM ■= \\f-p T} f\\ p L 



LP{T}) 



\\f-PTif\\ P LP{T ,y 



Here, we describe such a construction in the case m — 1. We define / on 1Z as a function 
of the first variable given by 



f(x,y) 



u(x 



if x G 



0, 



f(x,y) 



U X 



if x G ( -,1 



where u is a non-trivial function in L 2 ([0, |]) such that 



u(x)dx 



xu(x)dx —lx u(x)dx = 0. 
Jo 



A possible choice is u{x) = L 3 (4x — 1) = 160x 3 — 120x 2 + 24x— 1 where L 3 is the Legendre 
polynomial of degree 3 defined on [—1, 1]. With this choice, we have the following result. 

Lemma 7.3.3. Let T be any triangle such that its vertices have x coordinates either 
(0, |, 1) or (0, 1, 1) or (|, 1, 1). Then f is orthogonal to Pi in L 2 (T). 



Proof : Define 

T := <^ (x,y) G T, x G 



and Ti :- 



[x,y) G T, x G ( -,1 
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Then, with v(x,y) being either the function 1 or x or y, we have 

f T f(x,y)v(x,y)dxdy = f T u(x)v(x,y)dxdy + f Ti u(x)v(x,y)dxdy 

= J Q 5 u(x)q (x)dx + J 5 u(x)q 1 (x)dx, 

with 

q (x) := / v(x,y)dy, qi(x) := v(x,y)dy, 

where T ijX = {y : (x,y) G Tj} for i = 0, 1. The functions go an d qi are polynomials of 
degree at most 2 and we thus obtain from the properties of u that f T fv = 0. o 

The above lemma shows that for any of the three possible choices of bisection of T re f 
based on the L 2 decision function, the error is left unchanged since the projection of / on 
all possible sub-triangle is 0. There is therefore no preferred choice, and assuming that we 
bisect from the vertex (0,0) to the opposite mid-point (1, |), then we see that a similar 
situation occurs when splitting the two subtriangles. The rest of the arguments showing 
that / ^ V(f, TZ) P are the same as in the previous counter-example. 

The above two examples of non-convergence reflect the fact that when / has some 
oscillations, the refinement procedure cannot determine the most appropriate bisection. 
In order to circumvent this difficulty one needs to modify the refinement rule. 



7.3.3 A modified refinement rule 

Our modification consists of bisecting from the most recently generated vertex of T, 
in case the local error is not reduced enough by all three bisections. More precisely, we 
modify the choice of the bisection of any T as follows : 

Let e be the edge which minimizes the decision function d<r(e, /). If 

{eT } {f)l + e Ti {f)l) l,P <9e T {fl 

we bisect T towards the edge e (greedy bisection). Otherwise, we bisect T from its most 
recently generated vertex (newest vertex bisection). Here 9 is a fixed number in (0, 1). In 
the case p = oo we use the condition 

max{e r i(/),e I? (/)}<0e r (/). 

This new refinement rule benefits from the mesh size reduction properties of newest vertex 



bisection. Indeed, a property illustrated in Figure 7.1 is that a sequence {BNN} of one 
arbitrary bisection (B) followed by two newest vertex bisections (N) produces triangles 
with diameters bounded by half the diameter of the initial triangle. A more general pro- 
perty - which proof is elementary yet tedious - is the following : a sequence of the type 
{BNB ■ ■ ■ BN} of length k + 2 with a newest vertex bisection at iteration 2 and k + 2 
produces triangles with diameter bounded by (1 — 2~ k ) times the diameter of the initial 



triangle, the worst case being illustrated in Figure 7.2 with k = 3 



Our next result shows that the modified algorithm now converges for any / e IP . 
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Figure 7.1 - Diameter reduction by a {BNN} sequence 




Figure 7.2 - Diameter reduction by a {BNBBN} sequence (the dark triangle has dia- 
meter at most 7/8 times the initial diameter) 



Theorem 7.3.4. With 1Z defined as the modified bisection rule, we have 

feV{f,K) p , 

for any f G L P (Q) ( or C(Vt) when p = oo). 

Proof : We first give the proof when p < oo. For each triangle T G Dj with j > 1, we 
introduce the two quantities 

« T : = mP , TTyJ and p(T) := — -, , 

eT{f)p + e T'(f)p ep(r)(/)p 

where T' is the "brother" of T, i.e. C(V(T)) = {T,T ; }. When a greedy bisection occurs in 
the split of V(T), we have 0(T) < 9 P . When a newest vertex bisection occurs, we have 



infTrep,,, ||/ — k\\lp(v(t)) 



where C is the constant of (7.4). 

We now consider a given level index j > and the triangulation T>j . For each T eDj, 
we consider the chain of nested triangles (T n ) 3 n=0 with Tj = T and T n _x = V(T n ), n = 
j, j — 1, ■ • ■ , 1. We define 



a(T) = l[ a(T n ) and 0(T) = f[ (3(T n ) 



n=l n=l 

It is easy to see that 

«(T)/3(T) = ^| 
eT (/)p 

so that 

e T {f) p p < C a(T)P(T), 
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with Co := maxT eD e T (f)p- It is also easy to check by induction on j that 
£a(T)= E = ■ • • = ^ «(T) = #P . 

We denote by /j the approximation to / in Vj defined by fj = Arf on all T G Tj so that 

TdV 3 

In order to prove that fj converges to / in L p , it is sufficient to show that the sequence 

Ej := maxmin{/3(T), diam(T)}, 

tends to as j grows. Indeed, if this holds, we split Vj into two sets V^ and Vj over 
which 0(T) < Ej and diam(T) < Ej respectively. We can then write 



\\f-m* = E e T{fr P + e 



< C ^E«( T )+^E ^ H/-*CO 



< C #2> oei + C E in J 11/ " w 



p 

„> ••up cry 



where C is the constant of (7.4). Clearly the first term tends to and so does the second 
term by standard properties of LP spaces since the diameter of the triangles in Vj goes 
to 0. It thus remains to prove that 

lim Ej =0. 

j— >+oo 

Again we consider the chain (T n )^ =0 which terminates at T, and we associate to it a chain 
{q n Y~2o where q n = 1 or 2 if bisection of T n is greedy or newest vertex respectively. If r is 
the total number of 2 in the chain (q n ) we have 

P(t) < c w e p{j - r \ 



with C the constant in (7.4). Let a k > be a fixed number, large enough such that 

CQ k - 1 < 1. 

We thus have 

~/3(T) < (^C p 6 p ^ k ~ lS) ) r 6 p ^~ rkS) < p( j~ rk \ (7.9) 
We now denote by I the maximal number of disjoint sub-chains of the type 
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with Vj G {1,2} and of length q + 2 < 2k + 3 which can be extracted from (q n ) J n=0 . From 
the remarks on the diameter reduction properties of newest vertex bisection, we see that 



diam(T) < B(l - 2 



-2k\l 



with B := maxr og x) diam(To) a fixed constant. On the other hand, it is not difficult to 
check that 

r<3/ + 3 + ^^. (7.10) 

Indeed let a be the total number of 1 in the sequence (q n ) which are not preceeded by 
a 2, and let «j be the size of the series of 1 following the z-th occurence of 2 in (q n ) for 
i — 1, • • • , r. Note that some ol{ might be 0. Clearly we have 

j = a + ai H h a r + r. 

From the above equality, the number of i such that a« > 2k is less than ^ and therefore 
there is at least m > r — ^ indices {i , ■ ■ ■ ,i m -i} such that < 2k. Denoting by 
the position of the z-th occurence of 2 (so that /3i+\ = /3j + a» + 1), we now consider the 
disjoint sequences of indices 

&t = X^iztl 1 fiizt+2)l t = 0; 1; " ' ' 

There is at least y — 1 such sequences within {1, • • • , j} and by construction each of them 
contains a sequence of the type (z/ 1; 2, z/ 2 , z/ 3 , ■ ■ ■ ,u q ,2) with Uj G {1,2} and of length 
q + 2 < 2k + 3. Therefore the maximal number of disjoint sequences of such type satisfies 

1 7 — r 
I > -(r- J -—) - L 
- 3 V 2k ' 



which is equivalent to (7.10). Therefore according to (7.9) 

]3(T) < Q p ti~ rk } < 6)p(2~ 3 ( /+1 ) fc +i) 

If 3(Z + < J, we have 

P(t) < e v ~i. 

On the other hand, if 3(Z + l)k > |, we have 

diam(T) < 5(1 - 2- 2k )^-\ 
We therefore conclude that Ej goes to as j grows, which proves the result for p < oo. 

We briefly sketch the proof for p = oo, which is simpler. We now define (3(T) as 

er(/)oo 



/3(T) := 



ev(T)(f)c 



so that /3(T) < 9 if a greedy bisection occurs in the split of V(T). With the same definition 
of 0(T) we now have 

er(/)oo < C ^(T), 
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where Cq := maxT ex> ^T {f)oo- With the same definition of Sj and splitting of Vj, we now 
reach 

11/ - /ilU~ < max <^ C £j, C max inf ||/ - 7r||x,~(r) ? , 
I T623 7 7reP "i 

which again tends to if e 3 - tends to and / is continuous. The proof that Sj tends to 
as j grows is then similar to the case p < oo. o 

Remark 7.3.5. The choice of the parameter 9 < 1 deserves some attention : if it is 
chosen too small, then most bisections are of type N and we end up with an isotropic 
triangulation. In the case m = 1, a proper choice can be found by observing that when 
f has C 2 smoothness, it can be locally approximated by a quadratic polynomial q e IP2 
with er(/)p ~ Gr{q)p when the triangle T is small enough. For such quadratic functions 
q G P2, one can explicitely study the minimal error reduction which is always ensured by 
the greedy refinement rule defined a given decision function. In the particular case p = 2 
and with the choice At = Pt which is considered in the numerical experiments of §7-4 
explicit formulas for the error — PtQ\\l 2 (t) can be obtained by formal computing and can 
be used to prove a guaranteed error reduction by a factor 9* = | . It is therefore natural to 
choose 6 such that 9* < 9 < 1 (for example 9 — |j which ensures that bisections of type 
N only occur in the early steps of the algorithm, when the function still exhibits too many 
oscillations on some triangles. 



7.4 Numerical illustrations 

The following numerical experiments were conducted with piecewise linear approxi- 
mation in the I? setting : we use the L 2 -based decision function 

d T (f,e) :=\\f- P Ti f\\h m) + 11/ - PT?f\\h m , 

and we take for At the L 2 (T)-orthogonal projection Pt onto Pi. In these experiments, the 
function / is either a quadratic polynomial or a function with a simple analytic expression 
which allows us to compute the quantities ex(f)2 an d c?T(e, /) without any quadrature 
error, or a numerical image in which case the computation of these quantities is discretized 
on the pixel grid. 



7.4.1 Quadratic functions 

Our first goal is to illustrate numerically the optimal adaptation properties of the 
refinement procedure in terms of triangle shape. For this purpose, we take / = q a 
quadratic form i.e. an homogeneous polynomial of degree 2. In this case, all triangles 
should have the same aspect ratio since the Hessian is constant. In order to measure the 
quality of the shape of a triangle T in relation to q, we introduce the following quantity : 
if (a, b, c) are the edge vectors of T, we define 

m _ max{|q(q)|, |q(6)|, |q(c)|} 
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Figure 7.3 - V 8 for q(x,y) : = x 2 + lOOy 2 (left) and q(x,y) := x 2 - 10y 2 (right). 



where det(q) is the determinant of the 2x2 symmetric matrix Q associated with q, i.e. 
such that 

q(«) = (Qu,u) 

for all u G 1R 2 . Using the reference triangle and an affine change of variables, it is proved 



in of the next chapter that 

e T (q) p ~|T| 1+ ^ q (r) v /|det(q)| ! 

with equivalence constants independent of q and T. Therefore, if T is a triangle of given 
area, its shape should be designed in order to minimize p q (T). 

In the case where q is positive definite or negative definite, p q (T) takes small values 
when T is isotropic with respect to the metric \ {x, y)| q := ^/|q(a;, y)\, the minimal value 
being attained for an equilateral triangle for this metric. Specifically, we choose q(x, y) : = 



x 2 + lOCk/ 2 and display in Figure 7.3 (left) the triangulation V$ obtained after j = 8 
iterations of the refinement procedure, starting with a triangle which is equilateral for the 
euclidean metric (and therefore not adapted to q). Triangles such that p q (T) < 4 a/3 (at 
most 3 times the minimal value) are displayed in white, others in grey. We observe that 
most triangles produced by the refinement procedure are of the first type and therefore 
have a good aspect ratio. 



The case of a quadratic function of mixed signature is illustrated in Figure 7^3] (right) 
with q(x, y) := x 2 — 10y 2 . For such quadratic functions, triangles which are isotropic with 
respect to the metric | • || q | have a low value of p q , where |q| denotes the positive quadratic 
form associated to the absolute value \Q\ of the symmetric matrix Q associated to q. 
Recall for any symmetric matrix Q there exists Ax, A2 £ M and a rotation R such that 



and the absolute value \Q\ is defined as 



\Q\ = R L [ 'n 1 ' )R- 



|A 2 , 

In the present case, R = I and |q|(rr, = x 2 + 10y 2 . 
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Figure 7.4 - The function g s , for 5 = 0.02, 0.03, 0.07, 0.2. 



But one can also check that p q is left invariant by any linear transformation with 
eigenvalues (t, V) for any t > and eigenvectors (u,v) such that q(u) = q(i>) = 0, 
i.e. belonging to the null cone of q. More precisely, for any such transformation %p and 
any triangle T, one has p q (^(T)) = p q (T). In our example we have u = (vlO, 1) and 
v = (\/l0, —1))- Therefore long and thin triangles which are aligned with these vectors 
also have a low value of p q . Triangles T such that pi q i(T) < 4 a/3 are displayed in white, 
those such that p q (T) < 4 a/3 while P| q |(T) > 4 a/3 - i.e. adapted to q but not to |q| - 
are displayed in grey, and the others in dark. We observe that all the triangles triangles 
produced by the refinement procedure except one are either of the first or second type 
and therefore have a good aspect ratio. These empirical observations will be rigorously 
justified in §9.1 of Chapter 9. 



7.4.2 Sharp transition 

We next study the adaptive triangulations produced by the greedy tree algorithm for 
a function / displaying a sharp transition along a curved edge. Specifically we take 

f(x,y) = fs(x,y) := g s (\/x 2 + y 2 ), 

where gs is defined by gs{r) = for < r < 1, g$(l + 5 + r) = _Mizll_ f or r > 0, 
and gs is a polynomial of degree 5 on [1, 1 + 5] which is determined by imposing that 
gs is globally C 2 . The parameter 5 therefore measures the sharpness of the transition as 



illustrated in Figure 7.4 It can be shown that the Hessian of fs is negative definite for 



a/x 2 + y 2 < 1 + 5/2, and of mixed type for 1 + 5/2 < ^Jx 2 + y 2 < 2 + 5. 



Figure [775] displays the triangulation 7Ioooo obtained after 10000 steps of the algorithm 
for 5 = 0.2. In particular, triangles T such that p q (T) < 4 - where q is the quadratic form 
associated with d 2 f measured at the barycenter of T - are displayed in white, others in 
grey. As expected, most triangles are of the first type and therefore well adapted to /. 
We also display on this figure the adaptive isotropic triangulation produced by the greedy 
tree algorithm based on newest vertex bisection for the same number of triangles. 

Since / is a C 2 function, approximations by uniform, adaptive isotropic and adaptive 
anisotropic triangulations all yield the convergence rate 0{N~ X ). However the constant 

C := limsup N\\f - /jv||z,a, 

JV->+oo 

strongly differs depending on the algorithm and on the sharpness of the transition, as 
illustrated in the table below. We denote by Cjj, Cj and Ca the empirical constants 
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(estimated by N\\f — /jv||2 for A" = 8192) in the uniform, adaptive isotropic and adaptive 



\d 2 f\\ L *, IW 



d 2 f\\im and A(f) 



anisotropic case respectively, and by U(f) : 

|| y/\det(d 2 f)\ || L 2/3 the theoretical constants suggested by (7.3), (7.2) and (7.1 ). We observe 
that Cu and Cj grow in a similar way as U(f) and /(/) as S — >• (a detailed computation 
shows that U(f) « 10.37 5~ 3/2 and 1(f) « 14.01 <5~ 1/2 ). In contrast CU and remain 
uniformly bounded, a fact which reflects the superiority of the anisotropic mesh as the 
layer becomes thinner. 



5 


u(f) 


1(f) 


Mf) 


Cu 


C7 


c A 


0.2 


103 


27 


6.75 


7.87 


1.78 


0.74 


0.1 


602 


60 


8.50 


23.7 


2.98 


0.92 


0.05 


1705 


82 


8.48 


65.5 


4.13 


0.92 


0.02 


3670 


105 


8.47 


200 


6.60 


0.92 



7.4.3 Numerical images 

We finally apply the greedy tree algorithm to numerical images. In this case the data / 
has the form of a discrete array of pixels, and the L 2 (T)-orthogonal projection is replaced 
by the £ 2 (SV)-orthogonal projection, where St is the set of pixels with centers contained 
in T. The approximated 512 x 512 image is displayed in Figure [7^6 which also shows its 
approximation by the greedy tree algorithm based on newest vertex bisection with 
iV = 2000 triangles. The systematic use of isotropic triangles results in strong ringing 
artifacts near the edges. 



We display in Figure 7.7 the result of the same algorithm now based on our greedy 
bisection procedure with the same number of triangles. As expected, the edges are better 
approximated due to the presence of well oriented anisotropic triangles. Yet artifacts 
persist on certain edges due to oscillatory features in the image which tend to mislead the 



algorithm in its search for triangles with good aspect ratio, as explained in £ 7.3.2 These 
artifacts tend to disappear if we use the modified refinement rule proposed in §7.3.3 as also 
illustrated on Figure Wj\ This modification is thus useful in the practical application of the 
algorithm, in addition of being necessary for proving convergence of the approximations 
towards any LP function. Note that encoding a triangulation resulting from iterations 
of the anisotropic refinement algorithm is more costly than for the newest vertex rule : 
the algorithm encounters at most 2A^ triangles and for each of them, one needs to encode 



Numerical illustrations 




Figure 7.7 - /2000 with greedy bisection (left), modified procedure (rig! 
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one out of four options (bisect towards edge a or b or c or not bisect), therefore resulting 
into AN bits, while only two options need to be encoded when using the newest vertex 
rule (bisect or not), therefore resulting into 2N bits. In the perspective of applications to 
image compression, another issue is the quantization and encoding of the piecewise affine 
function as well as the treatment of the triangular visual artifacts that are inherent to the 
use of discontinuous piecewise polynomials on triangulated domains. These issues will be 
discussed in a further work specifically dealing with image applications. 



7.5 Conclusions and perspectives 

In this chapter, we have studied a simple greedy refinement procedure which generates 
triangles that tend to have an optimal aspect ratio. This fact is rigorously proved in Chap- 



ter |8| together with the optimal convergence estimate (7.1 ) for the adaptive triangulations 
constructed by the greedy tree algorithm in the case where the approximated function / 
is C 2 and convex. Our numerical results illustrate these properties. 

In the present chapter we also show that for a general / G LP the refinement procedure 
can be misled by oscillations in /, and that this drawback may be circumvented by a simple 
modification of the refinement procedure. This modification appears to be useful in image 
processing applications, as shown by our numerical results. 

Let us finally mention several perspectives that are raised from our work, and that are 
the object of current investigation : 

1. Conforming triangulations : our algorithm inherently generates hanging nodes, which 
might not be desirable in certain applications, such as the numerical discretization 
of PDE's where anisotropic elements are sometimes used (2]. When using the greedy 
tree algorithm, an obvious way of avoiding this phenomenon is to bisect the chosen 
triangle together with an adjacent triangle in order to preserve conformity. Howe- 
ver, it is no more clear that this strategy generates optimal triangulations. In fact, 
we observed that many inappropriately oriented triangles can be generated by this 
approach. An alternative strategy is to apply the non-conforming greedy tree al- 
gorithm until a prescribed accuracy is met, followed by an additional refinement 
procedure in order to remove hanging nodes. 

2. Discretization and encoding : our work is in part motivated by applications to image 
and terrain data processing and compression. In such applications the data to be 
approximated is usually given in discrete form (pixels or point clouds) and the 
algorithm can be adapted to such data, as shown in our numerical image examples. 
Key issues which need to be dealt with are then (i) the efficient encoding of the 
approximations and of the triangulations using the tree structure in a similar spirit 
as in [31] and (ii) the removal of the triangular visual artifacts due to discontinuous 
piecewise polynomial approximation by an appropriate post-processing step. 

3. Adaptation to curved edges : one of the motivation for the use of anisotropic trian- 
gulations is the approximation of functions with jump discontinuities along an edge. 
For simple functions, such as characteristic functions of domains with smooth boun- 
daries, the L p -error rate with an optimally adapted triangulation of N elements is 
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2 , 



known to be 0(N~p). This rate reflects an 0(1) error concentrated on a strip of area 
0(N~ 2 ) separating the curved edge from a polygonal line. Our first investigations in 
this direction indicate that the greedy tree algorithm based on our refinement pro- 
cedure cannot achieve this rate, due to the fact that bisection does not offer enough 
geometrical adaptation. This is in contrast with other splitting procedures, such as 
in 40 in which the direction of the new cutting edge is optimized within an infinite 



range of possible choices, or 48 where the number of choices grows together with 
the resolution level. An interesting question is thus to understand if the optimal rate 
for edges can be achieved by a splitting procedure with a small and fixed number 
of choices similar to our refinement procedure, which would be beneficial from both 
a computational and encoding viewpoint. This question is addressed, and partially 
answered, in §9.3 
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8.1 Introduction 

In finite element approximation, a classical and important distinction is made between 
uniform and adaptive methods. In the first case all the elements which constitute the mesh 
have comparable shape and size, while these attributes are allowed to vary strongly in the 
second case. An important feature of adaptive methods is the fact that the mesh is not 
fixed in advance but rather tailored to the properties of the function / to be approximated. 
Since the function approximating / is not picked from a fixed linear space, adaptive finite 
elements can be considered as an instance of non-linear approximation. Other instances 
include approximation by rational functions, or by iV-term linear combinations of a basis 



or dictionary. We refer to 42 for a general survey on non-linear approximation. 

In this chapter, we focus our interest on piecewise linear finite element functions defined 
over triangulations of a bidimensional polygonal domain f2 C IR 2 . Given a triangulation 
T we denote by Vj- := {v s.t. v\t G Pi, T G T} the associated finite element space. The 
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norm in which we measure the approximation error is the L p norm for 1 < p < oo and 
we therefore do not require that the triangulations are conforming and that the functions 
of V-f are continuous between triangles. For a given function / we define 

ejv(/)i,p := inf mf \\f - g\\ LP , 

#(T)<Ng&V T 

the best approximation error of / when using at most N elements. In adaptive finite 
element approximation, critical questions are : 

1. Given a function / and a number N > 0, how can we characterize the optimal mesh 
for / with N elements corresponding to the above defined best approximation error. 

2. What quantitative estimates are available for the best approximation error ctv(/)lp ? 
Such estimates should involve the derivatives of / in a different way than for non- 
adaptive meshes. 

3. Can we build by a simple algorithmic procedure a mesh Tn of cardinality iV and a 
finite element function /jy e Vqr N such that ||/ — /n\\lp is comparable to e^{f)LP ? 

While the optimal mesh is usually difficult to characterize exactly, it should satisfy 
two intuitively desirable features : (i) the triangulation should equidistribute the local 
approximation error between each triangle and (ii) the aspect ratio of a triangle T should 
be isotropic with respect to a distorted metric induced by the local value of the hessian 
d 2 f on T (and therefore anisotropic in the sense of the euclidean metric). Under such 
prescriptions on the mesh, quantitative error estimates have recently been obtained in 
[4,27 when / is a C 2 function. These estimates are of the form 



]imsup Ne N (f) LP < C\\ y/\ det{d 2 f)\\\ L r, - = - + 1, (8.1) 

JV^oo T p 

where det(<i 2 /) is the determinant of the 2x2 hessian matrix. For a convex C 2 function / 



this estimate has been proved to be asymptotically optimal in 27 , in the following sense 



liminfiVe A r(/) L p > c\W\ det(d 2 /)IIU- (8-2) 

iV-»+oo 

The convexity assumption can actually be replaced by a mild assumption on the sequence 
of triangulations which is used for the approximation of / : a sequence (Tn)n>n is said 
to be admissible if #(TJv) < iV and 



sup ( N 1 / 2 max diam(T) ] < oo. 

N>N V ^eTjv / 



Then it is proved in Chapter [2j that for any admissible sequence and any C 2 function /, 
one has 

liminfiV inf ||/ - g\\ L „ > c\\ y/\ det(d 2 f)\\\ L r. (8.3) 

TV^+oo ge V Tn 

The admissibility assumption is not a severe limitation for an upper estimate of the error 
since it is also proved that for all e > 0, there exist an admissible sequence such that 



limsupiV inf \\f-g\\ LP <C\\^\det(dif)\\\ L r+e. (8.4) 

N^+oo 9& V Tn 
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We also refer to Chapter [2] for a generalization of such upper and lower estimates to higher 
order elements. 

From the computational viewpoint, a commonly used strategy for designing an optimal 
mesh consists therefore in evaluating the hessian d 2 f and imposing that each triangle of the 
mesh is isotropic with respect to a metric which is properly related to its local value. We 



refer in particular to 16 where this program is executed using Delaunay mesh generation 
techniques. While these algorithms fastly produce anisotropic meshes which are naturally 
adapted to the approximated function, they suffer from two intrinsic limitations : 

1. They use the data of d 2 f, and therefore do not apply to non-smooth or noisy func- 
tions. 

2. They are non-hierarchical : for N > M, the triangulation T/v is not a refinement of 

7m- 

In Chapter [7j an alternate strategy is proposed for the design of adaptive hierarchical 
meshes, based on a simple greedy algorithm : starting from an initial triangulation 7n , 
the algorithm picks the triangle T G T/v with the largest local LP error. This triangle is 
then bisected from the mid-point of one of its edges to the opposite vertex. The choice of 
the edge among the three options is the one that minimizes the new approximation error 
after bisection. The algorithm can be applied to any L p function, smooth or not, in the 
context of piecewise polynomial approximation of any given order. In the case of piecewise 
linear approximation, numerical experiments in Chapter [7] indicate that this elementary 
strategy generates triangles with an optimal aspect ratio and approximations fjy G Vj- N 



such that ||/ — f N || LP satisfies the same estimate as ejq{f)i P in (8.1). 

The goal of this chapter is to support these experimental observations by a rigorous 
analysis. This chapter is organized as follows : 

In §8.2[ we introduce notations which are used throughout the chapter and collect 
some available approximation theory results for piecewise linear finite elements, making 
the distinction between (i) uniform, (ii) adaptive isotropic and (iii) adaptive anisotropic 
triangulations. In the last case, which is in the scope of this chapter, we introduce a 
measure of non-degeneracy of a triangle T with respect to a quadratic form. We show 



that the optimal error estimate (8.1) is met when each triangle is non-degenerate in the 
sense of the above measure with respect to the quadratic form given by the local hessian 
d 2 f. We end by briefly recalling the greedy algorithm which is introduced in Chapter [7j 



In £8.3, we study the behavior of the refinement procedure when applied to a quadratic 
function q such that its associated quadratic form q is of positive or negative sign. A key 
observation is that the edge which is bisected is the longest with respect to the metric 
induced by q. This allows us to prove that the triangles generated by the refinement 
procedure adopt an optimal aspect ratio in the sense of the non-degeneracy measure 



introduced in §8.2 



In t8A, we study the behavior of the algorithm when applied to a general C 2 func- 
tion / which is assumed to be strictly convex (or strictly concave). We first establish a 
perturbation result, which shows that when / is locally close to a quadratic function q 
the algorithm behaves in a similar manner as when applied to q. We then prove that the 
diameters of the triangles produced by the algorithm tend to zero so that the perturbation 
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result can be applied. This allows us to show that the optimal convergence estimate 



UmsupiV||/-/ iV || L p < C\W\det{d 2 f)\\\ L r (8.5) 

iV-s>oo 



is met by the sequence of approximations € Vr N generated by the algorithm. 

The extension of this result to an arbitrary C 2 function / remains an open problem. 
It is possible to proceed to an analysis similar to £ 8.3| in the case where the quadratic 



form q is of mixed sign, also proving that the triangles adopt an optimal aspect ratio as 
they get refined. We describe this analysis in §9.1. However, it seems difficult to extend 



the perturbation analysis of §8.4| to this new setting. In particular the diameters of the 
triangles are no more ensured to tend to zero, and one can even exhibit examples of 
non-convex C 2 functions / for which the approximation fails to converge towards / 
due to this phenomenon. Such examples are discussed in Chapter [7] which also proposes 
a modification of the algorithm for which convergence is always ensured. However, we do 



not know if the optimal convergence estimate (8.5) holds for any / e C with this modified 



algorithm, although this seems plausible from the numerical experiments. 



8.2 Adaptive finite element approximation 

8.2.1 Notations 

We shall make use of a linear approximation operator At that maps continuous func- 
tions defined on T onto Pi. For an arbitrary but fixed 1 < p < oo, we define the local LP 
approximation error 

e T (f) P := \\f-A T f\\ LP{T) . 
The critical assumptions in our analysis for the operator At will be the following : 

1. At is continuous in the L°° norm. 

2. At commutes with affine changes of variables : At{}') ° 4> = ^-i(t)(/ ° 0) for all 
afline <fi. 

3. At reproduces Pi : At(j) = tt, for any n G Pi. 

Note that the commutation assumption implies that for any function / and any affine 
transformation : x i— >■ Xq + Lx we have 

e^cr)(/)p=|det(L)| 1 /f er (/o0) J ,, (8.6) 

Two particularly simple admissible choices of approximation operators are the following : 

- At = Pt, the L 2 (T)-orthogonal projection operator : J T (f — Pt/)^ = for all 
ttGPi. 

- At = It, the local interpolation operator : It f{vi) = f{vi) with {i>o, t>i, ^2} the 
vertices of T. 

All our results are simultaneously valid when At is either Pt or It, or any linear operator 
that fulfills the three above assumptions. 
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Given a function / and a triangulation 7jv with N = #(7jv), we can associate a 
finite element approximation f N defined on each T G 7jv by /at(x) = ^4^/(x). The global 
approximation error is given by 

ll/-^IU, = (^e T (/)^, 
with the usual modification when p = oo. 

Remark 8.2.1. T/ie operator Bt of best L P (T) approximation which is defined by 

11/ - Brf\\i*(X) = m L n IIZ-tII^t)' 

does noi /a// zn the above category of operators, since it is non-linear (and not easy to 
compute) when p ^ 2. However, it is clear that any estimate on \\f — /jv||lp with $n 
defined as Arf on each T implies a similar estimate when f^ is defined as Brf on each 
T. 

Here and throughout the chapter, when 

q(x, y) = a 2fi x 2 + 2a hl xy + a 0t2 y 2 + a lfi x + a 0jl y + a 0fi 
we denote by q the associated quadratic form : if u = (x, y) 

q(u) = a 2t0 x 2 + 2a X)1 xy + a 0)2 y 2 . 

Note that q(u) = (Qu, u) where Q=( fl2 '° ) . We define 

\ 0-1,1 a 0i2 / 

det(q) := det(Q). 
If q is a positive or negative quadratic form, we define the q-metric 

M, := V^M (8-7) 

which coincides with the euclidean norm when q(t>) = x 2 + y 2 for v = (x,y). If q is a 
quadratic form of mixed sign, we define the associated positive form |q| which corresponds 
to the symmetric matrix \Q\ that has same eigenvectors as Q with eigenvalues (|A|, if 
(A,/i) are the eigenvalues of Q. Note that generally |q|(w) ^ |q(u)| and that one always 
has |q(u)| < |q|(w). 

Remark 8.2.2. IfdetQ > 0, then there exists a 2 x 2 matrix L and e G {+1, —1} such 
that 

The linear change of coordinates <f>(u) := Lu, where u = (x,y) G 1R 2 , therefore satisfies 
q o (fi(u) = e{x 2 + y 2 ) . On the other hand, if det Q < then there exists a 2 x 2 matrix L 
such that 

Defining again (f>(u) := Lu we obtain in this case q o (p{u) = x 2 — y 2 . 
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8.2.2 From uniform to adaptive isotropic triangulations 

A standard estimate in finite element approximation states that if / G W 2 ' P (Q) then 

inf \\f-g\\L P <Ch 2 \\d 2 f\\ LP , 
gev h 

where Vh is the piecewise linear finite element space associated with a triangulation Th of 
mesh size h := max^er,, diam(T). If we restrict our attention to uniform triangulations, 
we have 

N := #(%) ~ h- 2 . 

Therefore, denoting by e^ lf (/)iP the L p approximation error by a uniform triangulation 
of cardinality N, we can re-express the above estimate as 

et\f)Lv<CN- l \\d 2 f\\ LP . (8.8) 

This estimate can be significantly improved when using adaptive partitions. We give here 
some heuristic arguments, which are based on the assumption that on each triangle T the 
relative variation of d 2 f is small so that it can be considered as constant over T (which 
means that / is replaced by a quadratic function on each T), and we also indicate the 
available results which are proved more rigorously. 

First consider isotropic triangulations, i.e. such that all triangles satisfy a uniform 
estimate 

Pt = — <A } (8.9) 
r T 

where hr '■= diam(T) denotes the size of the longest edge of T, and tt is the radius of the 
largest disc contained in T. In such a case we start from the local approximation estimate 
on any T 

e T (f) P <Ch 2 T \\d 2 f\\ LP{T) , 



and notice that 

l2 II j2 jMI \m\ II j2 xll II j2.fl, 

|£ r (T), 



h 2 T \\d 2 f\\ LnT) ~ |T| |M 2 /IU*>(T) = UdVlb 



with - := - + 1 and |T| the area of T, where we have used the isotropy assumption (8.9) 
in the equivalence and the fact that d 2 f is constant over T in the equality. It follows that 

eT(/) P <C||d 2 /ll^(T), -:=- + l. 

T p 

Assume now that we can construct adaptive isotropic triangulations 1~n with iV := #(T/v) 
which equidistributes the local error in the sense that for some prescribed e > 

ce<e T (f) p <e, (8.10) 

with c > a fixed constant independent of T and N. Then defining as Ar(f) on each 
T G Tn, we have on the one hand 

\\f - InWu < NV*e, 
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and on the other hand, with - := - + 1, 



N(csy < ]T ||/- f N \\l HT) < £ W d2 f\\ T mr) < C T \\d 2 f\\l T . 



TeT N 



TeT N 



Combining both, one obtains for e^°(/)i> := ||/ — /n\\lp the estimate 



(8.11) 



This estimate improves upon (8.8) since the rate N' 1 is now obtained with the weaker 
smoothness condition d 2 f G L T and since, even for smooth /, the quantity ||c? 2 /||l t might 
be significantly smaller than ||d 2 /||i>- This type of result is classical in non-linear ap- 
proximation and also occurs when we consider best iV-term approximation in a wavelet 
basis. 

The principle of error equidistribution suggests a simple greedy algorithm to build an 
adaptive isotropic triangulation for a given /, similar to our algorithm but where the 
bisection of the triangle T that maximizes the local error eT{f) P is systematically done 
from its most recently created vertex in order to preserve the estimate (8.9). Such an 



algorithm cannot exactly equilibrate the error in the sense of (8.10) and therefore does 



not lead to the same the optimal estimate as in (8.11 ). However, it was proved in 13 that 
it satisfies 

\\f-fN\\l*<C\f\B>N-\ 



for all r such that 



< 



1, provided that the local approximation operator At is 



bounded in the LP norm. Here B 2 T denotes the usual Besov space which is a natural 



substitute for W 2,T when r < 1. Therefore this estimate is not far from (8.11). 



8.2.3 Anisotropic triangulations : the optimal aspect ratio 

We now turn to anisotropic adaptive triangulations, and start by discussing the optimal 
shape of a triangle T for a given function / at a given point. For this purpose, we again 
replace / by a quadratic function assuming that d 2 f is constant over T. For such a q G P2 
and its associated quadratic form q, we first derive an equivalent quantity for the local 
approximation error. Here and as well as in £8^3 and §8.4[ we consider a triangle T and 



we denote by (a, 6, c) its edge vectors oriented in clockwise or anti-clockwise direction so 
that 

a + b + c = 0. 

Proposition 8.2.3. The local L p -approximation error satisfies 

eT(q) P = e T (q) P ~ |T|?max{|q(a)|, |q(6)|, |q(c)|}, 
where the constant in the equivalence is independent of q, T and p. 

Proof: The first equality is trivial since q and q differ by an affine function. Let T eq be 
an equilateral triangle of area |T eq | = 1, and edges a,b,c. Let E be the 3-dimensional 



362 



Chapter 8. Greedy bisection generates optimally adapted triangulations 



vector space of all quadratic forms. Then the following quantities are norms on E, and 
thus equivalent : 

e Tcq (q) p ~ max{|q(a)|, |q(6)|, |q(c)|}. (8.12) 

Note that the constants in this equivalence are independent of p since all L p (T eq ) norms 
are uniformly equivalent on E. 

If T is an arbitrary triangle, there exists an affine transform : x i— > x$ + Lx such that 



T = 0(T eq ). For any quadratic function q, we thus obtain from (8.6) 



e T (q) = e T (q) = e 0(Tcq) (g) = | det L\ pe Tcq (q o 0) = | det L\ "e Tcq (q o L) 



since q o L is the homogeneous part of q o 0. By (8.12), we thus have 



e T (q) ~ | detL|p max{|q(La)|, \q{Lb)\, \q(Lc)\}, 

where {a, b, c} are again the edge vectors of T eq . Remarking that \T\ = | det L\ and that 
{La, Lb, Lc} are the edge vectors of T, this concludes the proof of this proposition. o 

In order to describe the optimal shape of a triangle T for the quadratic function q, we fix 
the area of \T\ and try to minimize the error eT(<?) P or equivalently max{|q(a)|, |q(6)|, |q(c)|}. 
The solution to this problem can be found by introducing for any q such that det(q) ^ 
the following measure of non- degeneracy for T : 

(T) = max{|q(q)| , |q(6)|, |q(c)|} 

Let 4> be a linear change of variables, q a quadratic form and T a triangle of edges a, b, c. 
Then det(q o 0) = (det 0) 2 det (q), the edges of <f>(T) are 0(a), 0(6), 0(c) and |0(T)| = 
| det0||T|. Hence we obtain 

max{|qo0(q)|, |qo0(6)|, |qo0(c)|} 

Pq ° 0( j |T|V|det(qo0)| 

max{|q(0(a))|,|q(0(6))|,|q(0(c))|} (8.14) 



pMt))- 



det0||T|v/|det(q)| 



The last equation, combined with Remark 8.2.2 allows to reduce the study of Pq(T) to 
two elementary cases by change of variable : 

1. The case where det(q) > is reduced to q(x,y) = x 2 + y 2 . Recall that for any 
triangle T with edges a,b,c we define hx '■= diam(T) = max{|a|, |6|, |c|}, with | • | 

the euclidean norm. In this case we therefore have p q (T) = fe, which corresponds 
to a standard measure of shape regularity in the sense that its boundedness is 



equivalent to a property such as (|8.9|). This quantity is minimized when the triangle 

that the 



28 



T is equilateral, with minimal value ^= (in fact it was also proved in 
minimum of the interpolation error ||q— Lp q||jy>m among all triangles of area \T\ = 1 
is attained when T is equilateral). For a general quadratic form q of positive sign, 
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we obtain by change of variable that the minimal value ^ is obtained for triangles 
which are equilateral with respect to the metric | • | q . More generally triangles with 
a good aspect ratio, i.e. a small value of p q (T), are those which are isotropic with 
respect to this metric. Of course, a similar conclusion holds for a quadratic form of 
negative sign. 

The case where det(q) < is reduced to q(x, y) = x 2 — y 2 . In this case, the analysis 
presented in (26) shows that the quantity p q (T) is minimized when T is a half of 
a square with sides parallel to the x and y axes, with minimal value 2. But using 



(8.14) we also notice that p q (T) = p q (L(T)) for any linear transformation L such 
that q = qoL. This holds if L has eigenvalues (A, y), where A 7^ 0, and eigenvectors 
(1, 1) and (—1,1). Therefore, all images of the half square by such transformations 
L are also optimal triangles. Note that such triangles can be highly anisotropic. 
For a general quadratic form q of mixed sign, we notice that p q (T) < p| q |(T), and 
therefore triangles which are equilateral with respect to the metric | • || q | have a good 
aspect ratio, i.e. a small value of p q (T). In addition, by similar arguments, we find 
that all images of such triangles by linear transforms L with eigenvalues (A, \) and 
eigenvectors (u, v) such that q(u) = q(w) = also have a good aspect ratio, since 
q = q o L for such transforms. 

We leave aside the special case where det(q) = 0. In such a case, the triangles minimizing 
the error for a given area degenerate in the sense that they should be infinitely long and 
thin, aligned with the direction of the null eigenvalue of q. 

Summing up, we find that triangles with a good aspect ratio are characterized by the 



fact that p q (T) is small. In addition, from Proposition 8.2.3 and the definition of p q (T), 
we have 

e T (q) p ~ |T| 1+ V|det(q)|p q (T) = || y/\ det(q)||| L . (r) p q (T), l - := ± + 1. (8.15) 

We now return to a function / such that d 2 f is assumed to be constant on every T E Tn- 
Assuming that all triangles have a good aspect ratio in the sense that 

P q (T) < C 

for some fixed constant C and with q the value of d 2 f over T, we find up to a change in 
C that 

e T (f) P < C\W\det(dif)\\\ L , (T) (8.16) 
By a similar reasoning as with isotropic triangulations, we now obtain that if the trian- 



gulation equidistributes the error in the sense of (8.10) 



11/ - InWlp < CN-'Wl det(d 2 f)\\\ LT , (8.17) 



and therefore (8.1) holds. This estimate improves upon (8.11) since the quantity \\d 



might be significantly larger than ||y | det(<i 2 /) | , in particular when / has some aniso- 
tropic features, such as sharp gradients along curved edges. 

The above derivation of ( |8.1| ) is heuristic and non-rigorous. Clearly, this estimate 
cannot be valid as such since det(<i 2 /) may vanish while the approximation error does not 
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(consider for instance / depending only on a single variable). More rigorous versions were 
derived in [27] and pj. In these results \d 2 f\ is typically replaced by a majorant \d 2 f\ +sl, 



avoiding that its determinant vanishes. The estimate (8.1) can then be rigorously proved 
but holds for N > N(e, /) large enough. This limitation is unavoidable and reflects the 
fact that enough resolution is needed so that the hessian can be viewed as locally constant 
over each optimized triangle. Another formulation, which is rigorously proved in Chapter 
[2j reads as follows. 

Proposition 8.2.4. There exists an absolute constant C > such that for any polygonal 
domain Q and any function f G C 2 (Q), one has 



limsup Ne N {f) LP < C\\^\det(d 2 



N- 



8.2.4 The greedy algorithm 

Given a target function /, our algorithm iteratively builds triangulations TJv with N = 
#(T/v) an d finite element approximations f^. The starting point is a coarse triangulation 
7iVo- Given Tn, the algorithm selects the triangle T which maximizes the local error er(/)p 
among all triangles of Tn, and bisects it from the mid-point of one of its edges towards 
the opposite vertex. This gives the new triangulation Tn+i- 

The critical part of the algorithm lies in the choice of the edge e G {a, b, c} from which 
T is bisected. Denoting by T\ and T 2 the two resulting triangles, we choose e as the 
minimizer of a decision function dr{c,f), which role is to drive the generated triangles 
towards an optimal aspect ratio. While the most natural choice for dr(e, f) corresponds 
to the split that minimizes the error after bisection, namely 

d T (e,f)=e T} (fy p + e Ti (fJ p , 

we shall instead focus our attention on a decision function which is defined as the L 1 norm 
of the interpolation error 

dr(e, /) = 11/ - Its f\\i>Pi) + 11/ ~ M /Hnpf). (8-18) 

For this decision, the analysis of the algorithm is made simpler, due to the fact that we 
can derive explicit expressions of ||/ — It/Hl^t) when / = q is a quadratic polynomial 
with a positive homogeneous part q. We prove in §8.3| that this choice leads to triangles 



with an optimal aspect ratio in the sense of a small p q (T). This leads us in ^8.4 to a proof 



that the algorithm satisfies the optimal convergence estimate (8.17) in the case where / 
is C 2 and strictly convex. 

Remark 8.2.5. It should be well understood that while the decision function is based on 
the L l norm, the selection of the triangle to be bisected is done by maximizing ct(/) p - The 
algorithm remains therefore governed by the LP norm in which we wish to minimize the 
error \ f — /jy||p for a given number of triangles. Intuitively, this means that the LP -norm 
influences the size of the triangles which have to equidistribute the error, but not their 
optimal shape. 
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Remark 8.2.6. It was pointed out to us that the L 1 norm of the interpolation error to 
a suitable convex function is also used to improve the mesh in the context of moving grid 
techniques, see [29]. 

We define a variant of the decision function as follows 

D T (eJ) := ||/-I T /|| il(T) -d T ( e ,/). 

Note that DT{e,f) is the reduction of the L l interpolation error resulting from the bi- 
section of the edge e, and that the selected edge that minimizes c?t(-,/) is also the one 
that maximizes D(-,f). The function has a simple expression in the case where / is 
a convex function. 

Lemma 8.2.7. Let T be a triangle and let f be a convex function on T . Let e be an edge 
of T with endpoints z and z\ . Then 

^(e,/) = ^( /W ; /W -/(^)). (8.19) 

If in addition f has C 2 smoothness, we also have 
ITI f 1 

D T (e,f) = — / (d 2 f(z t )e, e) min{t, 1 - t}dt, where z t := (1 - t)z + tz x . (8.20) 
6 Jo 

Proof: Since / is convex, we have It f > / on T, hence 

Wf-lTf\\lHX) = jjfrf-f). 

Similarly I T i / > / on T\ and Ij-2 / > / on T e 2 , hence 



D T (eJ) = / I T /- / I T i / - / I T| /. 

Jt JtI JtI 

Let z 2 be the vertex of T opposite the edge e. Since the function / is convex, it follows 
the previous expression that Dx{e, /) is the volume of the tetrahedron of vertices 

z 0,x + z l,x z 0,y + z \,y £ ( z + z l 



f y — 2 — J J and ( Zi > x ' Zi >y> for i = °' 2 - 

where (z itX , z iyy ) are the coordinates of z%. Let u = zq — Z2 and v = z\ — z 2 . We thus have 
D T (e,f) = ||'det(M)| where 



f(z )-f(z 2 ) f( Zl )-f(z 2 ) f{^)-f(z 2 ) 

Subtracting the half of the first two columns to the third one we find that M has the 
same determinant as 

u x v x 

M := I u y Vy 

f(z )-f(z 2 ) f( Zl )-f(z 2 ) 
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Recalling that 2\T\ = \det(u,v) 



we therefore obtain (8.19). In order to establish (8.20), 

= 8 — 28%/2+Sx, where 



we observe that we have in the distribution sense 9^(min{t, 1— 1} + ] 
5 t is the one-dimensional Dirac function at a point t. Hence for any univariate function 
h G C 2 ([0,1]), we have 

[ h"{t) min{t, 1 - t}dt = h(0) - 2/i(l/2) + h(l). 
Jo 



Combining this result with (8.19) we obtain (8.20). 



8.3 Positive quadratic functions 

In this section, we study the algorithm when applied to a quadratic polynomial q such 
that det(q) > 0. We shall assume without loss of generality that q is positive definite, 
since all our results extend in a trivial manner to the negative definite case. 

Our first observation is that the refinement procedure based on the decision function 



(8.18) always selects for bisection the longest edge in the sense of the q-metric | • | q defined 



by (8.7). 



Lemma 8.3.1. An edge eofT maximizes Z?r(e, g) among all edges ofT if and only if it 
maximizes |e| q among all edges ofT. 

Proof: The hessian d 2 q is constant and for all eei 2 one has 

(d 2 qe, e) = 2q(e). 



If e is an edge of a triangle T, and if q is a convex quadratic function, equation (8.20) 
therefore gives 

D T (e, g) = M q ( e )^ min{t,l-t}dt=^\e\l. (8.21) 
This concludes the proof. 

It follows from this lemma that the longest edge of T in the sense of the q-metric is 
selected for bisection by the decision function. In the remainder of this section, we use 
this fact to prove that the refinement procedure produces triangles which tend to adopt 
an optimal aspect ratio in the sense that Pq(T) becomes small in an average sense. 

For this purpose, it is convenient to introduce a close variant to p q (T) : if T is a 
triangle with edges a,b,c, such that |a| q > |6| q > |c| q , we define 

a m ._ q(fe) + q(c) _ K + \c\l 

Using the inequalities |&| q + |c| q < 2|a|„ and |a|_ < 2(|6|_ + |c| q ), we obtain the equivalence 



(8.23) 
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Similar to p q , this quantity is invariant under a linear coordinate changes 0, in the sense 
that 

^qo0(T) = CX q (0(T)), 



From (8.15) and (8.23) we can relate cx q to the local approximation error. 



Proposition 8.3.2. There exists a constant Co, which depends only on the choice of At, 
such that for any triangle T, quadratic function q and exponent 1 < p < oo, the local 
L p - approximation error satisfies 

C - 1 e T (g) p < <7 q (T)||^deti|| L r (T) < C e T (q) p . (8.24) 

where - := - + 1. 

T p 

Our next result shows that cx q (T) is always reduced by the refinement procedure. 

Proposition 8.3.3. If T is a triangle with children T\ and T 2 obtained by the refinement 
procedure for the quadratic function q, then 

max{o- q (Ti),o- q (T 2 )} < cr q (T). 

Proof: Assuming that |a| q > |o| q > |c| q , we know that the edge a is cut and that 
the children have area \T\/2 and edges a/2,6, (c — 6)/2 and a/2, (b — c)/2,c (recall that 
a + b + c = 0). We then have 

2|T| v ^et^ ( T q (r j ) < q^) +q ^Z^ (8.25) 

2 (8-27) 
2|T|v/detq a q (T). (8.28) 

o 



2 

q(6) + q(c 



Remark 8.3.4. For any positive definite quadratic form q 7 the minimum of <r q is 1, as 

is easily seen using the inequality 

2|det(6,c)| < 2|6||c| < \b\ 2 + |c| 2 , 

in which we have equality if and only ifb,c& M? are orthogonal vectors of the same norm. 
When q is the euclidean metric, the triangle that minimizes <r q is thus the half square. 
This is consistent with the above result since it is the only triangle which is similar (i.e. 
identical up to a translation, a rotation and a dilation) to both of its children after one 
step of longest edge bisection. 



Remark 8.3.5. A result of similar nature was already proved in '821 longest edge bi- 
section has the effect that the minimal angle in any triangle after an arbitrary number of 
refinements is at most twice the minimal angle of the initial triangle. 
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Our next objective is to show that as we iterate the refinement process, the value of 
cr q (T) becomes bounded independently of q for almost all generated triangles. For this 
purpose we introduce the following notation : if T is a triangle with edges such that 
Hq > \b\q > l c lqj we denote by i>q[T) the subtriangle of T obtained after bisection of a 
which contains the smallest edge c. We first establish inequalities between the measures 
Oq and p q applied to T and ^ q (T). 

Proposition 8.3.6. Let T be a triangle, then 



frqtyqCT)) < -Pq(T) 

pMt)) < ^ 



16 



P\{T) 



(8.29) 
(8.30) 



Proof: We first prove (8.29). Obviously, ipq(T) contains one edge s G {a, 6, c} from T, 
and one half edge t G {§,§,§} from T. Therefore 



CTq(Vq(T)) < 



+ I* 



< 



Hq + 



012 
2 Iq 



4|</> q (T)| v /deTq~ ~ 2|T|v^eT^ 8 



Pq(^)- 



For the proof of (8.30), we restrict our attention to the case q = x + y , without loss 
of generality thanks to the invariance formula (8.14). Let T be a triangle with edges 
M > \b\ > |c|. If /i is the width of T in the direction perpendicular to a, then 



2\T\ 



2\a\ 
PqM 



The sub-triangle ^ q (T) of T has edges |, c, d where d 



and the angles at the ends 



of | are acute. Indeed 



1 
4 



(c,a/2) 

By Pythagora's theorem we thus find 



2 ) < and (d, a/2) = - (\c\ 2 - |6| 2 ) < 0. 



max{ 



WMd| 2 }< 



+ h 2 



\a\' 

T 



16 



PI(T) 



Dividing by the respective areas of T and ipq(T), we obtain the announced result. 



o 



Our next result shows that a significant reduction of a q occurs at least for one of the 
triangles obtained by three successive refinements, unless it has reached a small value of 
o-q. We use the notation ^J(T) := ^ q (^ q (T)) and ^\{T) := ^q(^q(^))- 

Proposition 8.3.7. Let T be a triangle such that o"q(^ q (T)) > 5. Then a^ip^T)) < 
0.69a q (T). 
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Proof: The monotonicity of <r q established in Proposition (8.3.3) implies that 

5 < Cq(V'qCO) < ffq(V?(T)) < <7q(^q(T)). 



Combining this with inequality (8.29) we obtain 



8 < min{p q (^(T)), p q (^ q (T)), p q (T)}. 
If a triangle S* obeys Pq(S) > 4, then 

H 1 + ^P)) £1 



and therefore p q (V , q(»S')) < Pq('S') according to inequality (8.30). We can apply this to 
5* = ipq(T) and S = T, therefore obtaining 



PMT)) < Pq(Vq(T)) < P q (T). 



(8.31) 



We now remark that inequality (8.30) is equivalent to (p q (S')— p q (^ q (S'))) 2 > p q (V'q(>S')) 2 " 
16, hence 



P*(S) > P*(MS)) + V /Pq^q(^)) 2 " 16 



(8.32) 



provided that p q (5*) > p q (^ q (S')). Applying this to S 1 = V'q(^) and recalling that p q (V' q (^)) > 
8 we obtain 

p q (^ q (T)) > 8 + V8 2 - 16 > 14.9. 



Applying again (8.32) to S = T we obtain 



p q (T) > 14.9 + V14.9 2 - 16 > 29.3. 



Using (8.30), it follows that 



P«(T)) i 



16 



16 



P(T) ~ 8 ^ ' P 2 M(T))J V Pq(^q(T)) 

Eventually, the inequalities ( |8.23 ) imply that 



1 + 



16 



< 0.171. 



2<7 q «(T)) < Pq«(T)) < 0.171p q (T) < 0.171(8a q (T)) 



which concludes the proof. 



o 



An immediate consequence of Propositions 8.3.3 and 8.3.7 is the following 



Corollary 8.3.8. If (Tj)? =1 are the eight children obtained from three successive refine- 
ment procedures from T for the function q, then 

- for all i, o-q(Ti) < cr q (T), 

- there exists i such that cr q (Tj) < 0.69cr q (T) or o" q (Tj) < 5. 
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We are now ready to prove that most triangles tend to adopt an optimal aspect ratio 
as one iterates the refinement procedure. 

Theorem 8.3.9. Let T be a triangle, and q a positive definite quadratic function. Let 
k = llL -1^o 69) 15 • ^ en o,fter n applications of the refinement procedure starting from T , 
at most Cn k 7 n / 3 of the 2 n generated triangles satisfy cr q (5) > 5 ; where C is an absolute 
constant. Therefore the proportion of such triangles tends exponentially fast to as n — > 
+oo. 

Proof: If we prove the proposition for n multiple of 3, then it will hold for all n (with 
a larger constant) since <r q decreases at each refinement step. We now assume that n = 
3m, and consider the octree with root T obtained by only considering the triangles of 
generation 3i for i — 0, • • • , n. 



According to Corollary |8.3.8[ for each node of this tree, one of its eight children either 
checks cr q < 5 or has its non-degeneracy measure diminished by a factor 9 := 0.69. We 
remark that if <r q is diminished at least k times on the path going from the root T to a 
leaf S, then cr q (5) < 5. As a consequence, the number N(m) of triangles S which are such 
that Gq(S) > 5 within the generation level n = 3m is bounded by the number of words in 
an eight letters alphabet {ax, • • • , a§} with length m and that use the letter as at most k 
times, namely 



N(m) < (7) ^ ^ CmkT 

1— n V / 



1=0 

which is the announced result. o 

The fact that most triangles tend to adopt an optimal aspect ratio as one iterates the 
refinement procedure is a first hint that the approximation error in the greedy algorithm 



might satisfy the estimate (8.1) corresponding to an optimal triangulation. The following 
result shows that this is indeed the case, when this algorithm is applied on a triangular 
domain Q to a quadratic function q with positive definite associated quadratic form q. 
The extension of this result to more general C 2 convex functions on polygonal domains 
requires a more involved analysis based on local perturbation arguments and is the object 
of the next section. 

Corollary 8.3.10. Let Q be a triangle, and let q be a quadratic function with positive 
definite associated quadratic form q. Let q^ be the approximant of q on Q obtained by the 



greedy algorithm for the LP metric, using the L 1 decision function (8.18). Then 

limsupiV||g - q N \\LP(n) < C\\ \/dct(q) 



where ^ = - + 1 and where the constant C depends only on on the choice of the approxi- 
mation operator At used in the definition of the approximant. 

Proof: For any triangle T, quadratic function q G P2, and exponent p, let 

e 'A<i)p '■= ini Ik - A\lp(t) 

7rePi 
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be the error of best approximation of q on T. Let To be a fixed triangle of area 1, then for 
any g G P 2 and 1 < p < oo one has 

e' To (q)i < e' To (q) p < e To (q) P < e T() (g)oo- 

Furthermore, e' To (-)i and er (-)oo are semi norms on the finite dimensional space P2 which 
vanish precisely on the same subspace of P2, namely Pi. Hence these semi-norms are 
equivalent. It follows that 



c e To {q) P < e' To {q) p < e To {q) 1 



(8.33) 



where Cq is independent g6P 2 and of p > 1. Using the invariance property (8.6) we find 
that (8.33) holds for any triangle T in place of T with the same constant Cq. We also 
define for any triangulation 7~, 

e r(f) P P ■= £ e T {f)l and e' T {fY p := £ e' T (f) p p , 



TeT 



TeT 



and we remark that Co er(q) p < e' T (q) p < er(q) P - For each n, we denote by T„ the 
triangulation of Q produced by n successive refinements based on the L 1 decision function 
(8.18) for the quadratic function q of interest (note that #(7^) = 2 n ). We also define 
T° := {TeT;; (T q (T) > 5}. T herefore cr q (T) < 5 if T ^ 7£, and on the other hand 
we know from Proposition 8.3.3 that cr q (T) < cr q (fi) for any T G ■ It follows from 
Proposition 8.3.2| that 



er«(<?) P < C [J2 TeT u((T n (T)\T\ Vdetq 



2" 



T \/detq, 



where Co is the constant in (8.24). According to Theorem 8.3.9, we know that 



lim 2-#CC) = 0. 

n— >+oo 



Hence 



lim sup 2 n er^(q) P < 5Co T a/ det q = 5Cq|| a/ det q 1 1 jjt 



(fi). 



We now denote by T£ the triangulation generated by the greedy procedure with stopping 
criterion based on the error n n := C 1 2~t ||\/det q||i/r(n)- It follows from (8.24) that for 
all T G 7^ with k < n, one has 

er(?) P > C _ V q (T) || a/ det q|| L r (T) > C _1 2 _ - || ^/det q|| L r (Q) > r] n , 



where we used that \T\ 



2 |0| and that the minimal value of <r q is 1. This shows that 



T g n is a refinement of Furthermore any triangle T G 7^ has at most 2 k ^ children in 
T^, where k{T) is the smallest integer such that 

n + k(T) / 

Vn>C 2 - o- q (T)||vdetq|| L r(n). 



372 Chapter 8. Greedy bisection generates optimally adapted triangulations 

Since \ < r < 1 we obtain 2 k ^ < 2^ < 2? C$cr q (T) < 4C$a q (T). Hence 
401) < ACl °«{T) < ACl (5 x T + MWCO = Ci2 n (l + 

where C\ = 20 Cq and e n — )■ as n — >• oo. If 7jv is the triangulation generated after N 
steps of the greedy algorithm, then there exists n > such that 7jv is a refinement of 
(hence a refinement of T") and 7^ 9 +1 is a refinement of 7/v- It follows that #(TJv) < 
#CC +1 )<C 1 2"+ 1 (l + 5 n+1 ), and 

coe TiV (g)p < e^^p < e' T u(q) p < e T u(q) p , 

where we have used the fact that e' T (f) p < e'^(f) p whenever T is a refinement of T. 
Eventually, 

C 10C C 
limsupiVe rjv (g)p < limsup — 2 n+1 (l + e n+1 )e T u (q) < —\\^/detq\\ L r {n) , 

N— >oo n— >oo Co Co 

which concludes the proof. o 



8.4 The case of strictly convex functions 

The goal of this section is to prove that the approximation error in the greedy algo- 
rithm applied to a C 2 function / satisfies the estimate (8.1) corresponding to an optimal 
triangulation. Our main result is so far limited to the case where / is strictly convex. 

Theorem 8.4.1. Let f E C 2 (U) be such that 

d 2 f(x) > ml, for all x E^l 

for some arbitrary but fixed m > independent of x. Let /jv be the approximant obtained 
by the greedy algorithm for the L p metric, using the L 1 decision function (8. IS). Then 

\imsupN\\f - f N \\ LP <C\\y/det(d 2 f)\\ L r, 



where 1 = - + 1 and where C is a constant independent of p, f and m. 



8.34) 



Equation (8.34) can be rephrased as follows : there exists a sequence £jv(/) such that 
£ N(f) — > as N — > oo and 

||/-/*IUp< (c\\^/fet(¥fj\\ L r +e N (f)^N-\ 



< 



Note also that since || ^Jdet(d 2 f) ||^t > 0, there exists N (f) such that ||/ — /jv||lp 
2C\\^det(d 2 f)\\ L rN- 1 for all N > N (f). It should be stressed hard that N (f) can 
be arbitrarily large depending on the function /. Intuitively, this means that when / 
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has very large hessian at certain point, it takes more iterations for the algorithm to 
generate triangles with a good aspect ratio. The extension of this result to strictly concave 
functions is immediate by a change of sign. Its extension to arbitrary C 2 functions is so 
far incomplete, as it is explained in the end of the introduction. The proof of Theorem 



8.4.1 uses the fact that a strictly convex C 2 function is locally close to a quadratic function 
with positive definite hessian, which allows us to exploit the results obtained in §8.3| for 
these particular functions. 

8.4.1 A perturbation result 

We consider a triangle T, a function / G C 2 (T), a convex quadratic function q and 
H > such that on T 

d 2 q<d 2 f<(l + fi)d 2 q. (8.35) 

It follows that det(d 2 q) < det(d 2 f) < det((l + //)d 2 g) = (l + /i) 2 det(cPg). Since det(d 2 q) = 
4det(q), we obtain 

2||V^tq|U-(D < || ^det d^f\\ L , {T) < 2(1 + fj)\\ y/fet^\\ LHT) . (8.36) 

The following Lemma shows how the local errors associated to / and q are close 

Proposition 8.4.2. The exists a constant C e > 0, depending only on the operator At 
such that 

(1 - C e fi)e T (q) p < e T (f) p < (1 + C ef x)e T (q) p . (8.37) 



Proof: It follows from inequality (8.35) that the functions / — q and (1 + u)q — f are 
convex, hence 

lT(/-?)-(/-g)>0 and I T ((l + /i )g-/)-((l + /i )g-/)>0 
on the triangle T. We therefore obtain 

< (l T f - f) - (l T q - q) < fi(l T q - q). 
There exists a constant Cq > depending only on At such that for any h 6 C°(T), 

e T (h) p < C \T\p\\h\\ L ^ {T ). 



Furthermore according to Proposition 8.2.3 there exists a constant C\ > depending only 
on At such that 



Hence 



|T| 1/p ||g - I T g|| L oo (T) < C 1 e T {q) p . 

\e T (f) P ~ e T (q) P \ < e T (f - q) p 

= e T {{l T f-f)-{l T q-q)) p 

< C |T|i||(I r / - /) - - g)|U~ (r) 

< Co|T|5||//(I T g- <?)||l°°(t) 

< C C 1 ne T (q) p 



374 



Chapter 8. Greedy bisection generates optimally adapted triangulations 



This concludes the proof of this Lemma, with C e = CqC\. 



Note that using Proposition 8.3.2, and assuming that // < c e := ^r, we have with 
i:=l + i, 

e T (f) P ~ e T (q) p ~ a q (T)|| ^ detq||| LT(r) ~ <r q (T) || ^det rf 2 /ll^(T), (8.38) 

with absolute constants in the equivalence. 

We next study the behavior of the decision function e t-» <ir(e, /). For this purpose, 
we introduce the following definition. 

Definition 8.4.3. Let T be a triangle with edges a,b,c. A 5-near longest edge bisection 
with respect to the q-metric is a bisection of any edge e 6 {a, b, c} such that 

q(e) >(l-5) max{q(a), q(6), q(c)} 



Proposition 8.4.4. Assume that f and q satisfy (8.35). Then, the bisection of T pres- 
cribed by the decision function e h-> dx{e, f) is a fi-near longest edge bisection for the 
q-metric. 



Proof: It follows directly from Equation (8.20) that for any edge e of T, 



D T {e,q) < D T {e,f) < D T {e,{l + fi)q), 



hence we obtain using (8.21) 



Hq(e) < D T (e, f) < (1 + /i)^q(e). (8.39) 

Therefore the bisection of T prescribed by the decision function e dr(e, f) selects an e 
such that 

(1 + /i)q(e) > max{q(a), q(6), q(c)}. 

It is therefore a 5-near longest edge bisection for the q-metric with 5 = < ti and 
therefore also a /z-near longest edge bisection. o 

In the rest of this section, we analyze the difference between a longest edge bisection in 
the q-metric and a 5-near longest edge bisection. For that purpose we introduce a distance 
between triangles : if Ti, T 2 are two triangles with edges a\, bi, C\ and a 2 , b 2 , c 2 such that 

q(ai) > q(6 x ) > q(ci) and q(a 2 ) > q(6 2 ) > q(c 2 ), (8.40) 

we define 

A q (Ti,T 2 ) = max{|q(ai) - q(a 2 )|, |q(6i) - q(6 2 )|, |q(ci) - q(c 2 )|}. 
Note that A q is a distance up to rigid transformations. 
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Lemma 8.4.5. Let Ti,T 2 be two triangles, let (R l7 U\) and (R 2 ,U 2 ) be the two pairs of 
children from the longest edge bisection of T\ in the q-metric, and a 5-near longest edge 
bisection ofT 2 in the q-metric. Then, up to a permutation of the pair of triangles (R\, U\), 

max{A q (i2i, J R 3 ),A q (?7i,?7 a )} < jA q (Ti, T 2 ) + 6q(a 2 ). 
where a 2 is the longest edge ofT 2 in the q-metric. 



Proof: We assume that the edges of T\ and T 2 are named and ordered as in (8.40). Up to 
a permutation, R\ and U\ have edge vectors b\, a\/2, (ci — b\)/2 and c%, ai/2, (61 — ci)/2. 
Two situations might occur for the pair (R 2 , U 2 ) : 

- q(e) < (1— <5)q(a 2 ) for e = b 2 and c 2 . In such a case the triangle T 2 is bisected towards 
a 2 , so that up to a permutation, R 2 and U 2 have edge vectors b 2 ,a 2 /2, (c 2 — b 2 )/2 
and c 2 , a 2 /2, \b 2 - c 2 ) /2. Using that q((c - b) /2) = q(c) /2 + q(6) /2 - q(a) /4 when 
a + fe + c = 0, it clearly follows that 

maxjAq^.^j.A^t/!,^)} < ^A q (T 1; T 2 ). 

" q( e ) > (1 — ^)q( a 2) for some e = 6 2 or c 2 . In such a case T 2 may be bisected say 
towards b 2 , so that up to a permutation, R 2 and C/ 2 have edge vectors a 2 , b 2 /2, (c 2 — 
a 2 )/2 and c 2 ,b 2 /2, (b 2 — c 2 )/2. But since |q(& 2 ) — q( fl 2)| < ^(02), we obtain that 

maxjAq^!, A q (C/ 1; U 2 )} < ^A q (Ti, T 2 ) + 5q(a 2 ). (8.41) 
We now introduce a perturbed version of the estimates describing the decay of the 



non- degeneracy measure which were obtained in Proposition 8.3.3 and Corollary 8.3.8 



Proposition 8.4.6. If(Ti)f =1 are the two children obtained from a refinement of a triangle 
T in which a 5-near longest edge bisection in the (^-metric is selected, then 

maxKCZl), (x q (T 2 )} < (1 + 45)a q (T). (8.42) 

1 are the eight children of a triangle T obtained from three successive refinements 
in which a 5-near longest edge bisection in the q-metric is selected, then 

- for all i, a q (T,) < a n (T)(l + C 2 5) , 

- there exists i such that cr q (Tj) < 0.69cr q (T)(l + C 2 5) or cx q (Tj) < M , 
where C 2 = f and M = 5(1 + C 2 5) . 



Proof: We first prove (8.42), and for that purpose we introduce the two children T[,T 2 



obtained by bisecting the longest edge of T in the q-metric. If follows from (8.41) that, 
up to a permutation of the pair (T[,T 2 ), 

max{A q (T!, T[), A q (T 2 , T 2 )} < 8q{a), 
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where a is the longest edge of T in the q-metric. Hence 



q(aj 



4|T i | x /det(q) 4|Ti|<y/det(q) 



<4&x q (T). 



(8.43) 



We know from Proposition 8.3.3 that max{cr q (T 1 / ), c^T^)} < cr q (T). Combining this point 



with (8.43) we conclude the proof of (8.42). 



We now turn to proof of the second part of the proposition and for that purpose we 
introduce the eight children (T/)f =1 obtained from three successive refinements of T in 



which the longest edge in the q-metric is selected. Iterating (8.41), we find that, up to a 
permutation of the triangles (T/)f =1 , one has 



61 

5q(a) = T-^q(a) 
16 



Co5 



q(a) 



where, again, a is the longest edge of T in the q-metric. Repeating the argument (8.43) 
we find that 

(8.44) 



max |a- q (Ti) - a q (7?)| < C 2 5a n (T). 

{ — 1,*" ,o 



We know from Corollary 8.3.8 that er q (T/) < cr q (T) for all i and that there exists i such 



that either cr q (T/) < 0.69cr q (T) or cr q (T/) < 5. Combining this point with (8.44) we 
conclude the proof of the proposition. o 



8.4.2 Local optimality 



Our next step towards the proof of Theorem 8.4.1 is to show that the triangulation 
produced by the greedy algorithm is locally optimal in the following sense : if the refine- 
ment procedure for the function / produces a triangle TeDon which / is close enough 
to a quadratic function q, then the triangles which are generated from the refinement of T 
tend to adopt an optimal aspect ratio in the q-metric, and a local version of the optimal 



estimate (8.1) holds on T 



We first prove that most triangles adopt an optimal aspect ratio as we iterate the 



refinement procedure. Our goal is thus to obtain a result similar to Theorem 8.3.9 which 
was restricted to quadratic functions. However, due to the perturbations by C 2 /i that 



appear in Proposition 8.4.6 the formulation will be slightly different, yet sufficient for our 
purposes : we shall prove that the measure of non-degeneracy becomes bounded by an 
absolute constant in an average sense, as we iterate the refinement procedure. 

As in the previous section, we assume that / and q satisfy (8.35). For any T 



we 



define T^{T) the triangulation of T which is built by iteratively applying the refinement 
procedure for the function / to all generated triangles up to 3n generation levels. Note 
that 

#(T„ M (T)) = 2 3n and \T'\ = 2- 3n \T\, T'eT^{T). 
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For r > 0, we define the average r-th power of the measure of non- degeneracy of the 2 3n 
triangles obtained from T after 3n iterations by 



7 (r, fj) := i (0.69(1 + C^J + 7 -(l + C 2f i)' 



We also define 



where C2 is the constant in Proposition 8.4.6[ Note that for any r > 0, the function 7(7", •) 
is continuous and increasing, and that < j(r,0) < 1. Hence for any r > 0, there exists 
fx(r) > and < 7(7*) < 1 such that 7(7", //) < 7(7*), if < // < /x(t'). 



Proposition 8.4.7. Assume that f and q satisfy (8.35) with < /i < /i(r). VKe i/ien /iai>e 

^H<<(T) 7 (rr+ - U ' 



KI-7W) 



where M is the constant in Proposition \8.4-o] Therefore 

M r 

Proof: Let us use the notations u = 0.69(1 + Gi\i) and v = (1 + C2//). According to 



Proposition 8.4.6 we have 



where E is the expectation operator and a n is the Markov chain with value in [l,+oo[ 
defined by 

- er n+ i = m&x{a n u, M} with probability a := |, 

- cr n+ i = cr n f with probability /3 := |, 

- cr := °"q(^o) with probability 1. 

Denoting by \i n the probability distribution of a n , we have 



n+l) 



/oo 
a r dfi n+1 (a) 



(a(max{-ucr, M}) r + /3(t><j) r ) dfi n (a) 



pM/u poo p+00 

aM r / d/j, n (a) + au r / a r du. n (cr) + /3v r / a r dfx n (a) 

Jl JM/u Jl 



< aM r + (au r + (3v r )E(a r n 

< «M r + 7 (r)E«) 



By iteration, it follows that 



E«) < EK) 7 (rr + 



IV) 



378 



Chapter 8. Greedy bisection generates optimally adapted triangulations 



which gives the result. 

Our next goal is to show that the greedy algorithm initialized from T generates a 
triangulation which is a refinement of T™(T) and therefore more accurate, yet with a 
similar amount of triangles. To this end, we apply the greedy algorithm with root T and 
stopping criterion given by the local error 

ri := min e T '(f) p - 

Therefore T' is splitted if and only if eT'(f) p > V- We denote by Tn(T) the resulting 
triangulation where N is its cardinality. From the definition of the stopping criterion, it 
is clear that Tn(T) is a refinement of 7^"(T). 



Proposition 8.4.8. Assume that f and q satisfy (8.35) with n < \, and define r : 



\s A-hia > 0. We then have 



N < C 4 2 3 X°H ? 

where C 4 is an absolute constant. Assuming in addition that fi < fi(r ) as in Proposition 



.4- 1\ we obtain that 



N < CV2 3n , 



z/2 3 " > 8a q (T) A with A := _ 3 ^ o) , and where C 5 = C 3 C 4 . 

Proof: Let T\ be a triangle in 7^"(T) and T 2 a triangle in T/v(T) such that T 2 C 7\. We 
shall give a bound on the number of splits k which were applied between Tj and T2, i.e. 



such that IT2I = 2 fc |Ti|. We first remark that according to Proposition 8.3.2 and (8.38), 
we have 

ri > c min \T'\ 1+ p<j a (T') -v/det q > c|Ti| 1+ p v det q, 
T'eT-(T) 



where c is an absolute constant. On the other hand, using both Proposition 8.4.2 and 



Proposition 8.4.6 we obtain 



e T2 (fU < C|T 2 | 1+ ^ q (T 2 )v/det^ 

= C|T 1 | 1+ h- fe(1+ ^ ) ( T q (T 2 ) v ^et^ 

< C\T 1 \ 1+1 ra ^ (T 1 )(2- (1+ l\l + A|2)y^/deJ^. 
<%(Ti)(^" 



< S^(n)(!)N 



k 

V 



where C is an absolute constant. Therefore we see that k is at most the smallest integer 
such that ^<j q (T 1 )(|) fc < 1. It follows that the total number n(T x ) of triangles T 2 G T/v(T) 
which are contained in T\ is bounded by 

n(T0 < 2 k < 2 — (r q (Ti 



c 
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and therefore 



N 



3n <°(n) 



with C A = 2 (f ) °. The fact that iV < C 5 2 3n when 2 3n > 8cx q (T) A with A 



immediate consequence of Proposition 8.4.7 



3r In 2 
- In 7(^o) 



is an 

o 



8.4.3 Optimal convergence estimates 



Our last step towards the proof of Theorem 8.4.1 consists in deriving local error esti- 
mates for the greedy algorithm. For rj > 0, we denote by f n the approximant to / obtained 
by the greedy algorithm with stopping criterion given by the local error 77 : a triangle T 
is splitted if and only if ex(f)p > rj. The resulting triangulation is denoted by 

T v = Tn, with N = N( V ) = #{%). 

For this N, we thus have f v = Jn- For a given T generated by the refinement procedure 
and such that r\ < er(/) p , we also define 

%(T) = {VcT;Ve %} 

the triangles in 7^ which are contained in T and 

N(T,rj) = #(%(?)). 

Our next result provides with estimates of the local error ||/ — f n \\LP(T) an d of N(T, rf) in 
terms of 77, provided that \i is small enough. 



Theorem 8.4.9. Assume that f and q satisfy (8.35) with \i < C2 := min{ |, /i(r )}, and 
that 1] < 7]o, where 



r/o = r) (T) := ^ - ^L j ^/detq, 



with X : = 3rol ? 2 , ; and 1 = 1 + 1. Then 

— m 7(ro) 7 t p 

\\f-f v \\L P[ T)<vN(T,ri)K (8.45) 

and 

iV(T,r/) < C 6 v- T Wtet(<Pf)\\ T mT), (8-46) 
where Cq is an absolute constant. 

Proof: The first estimate is trivial since 

\\f ~ M\LP(T) = ( £ *!*(/)?)'<( £ 77 P ^ * = T)N(T, rf)*. 
T'e%,(T) T'e%(T) 
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In the case p = oo, we trivially have 

11/ - /i,|U»(T) < v- 

For the second estimate, we define no = uq(T) the smallest positive integer such that 
2^o(T) > 8<r q (T) A with A := _ 3 ^°^ ) • For any fixed n > n , we define 

r/ n := min e T >{f) p . 
T'er»(T) 



We know from Proposition 8.4.8 that with the choice rj = r\ n 

N(T,n n )<C 5 2 3n . 



(8.47) 



8.4.7 



On the other hand, we know from Proposition 
follows that 

min aJT') < . 

T'eT n "(T) 



that crq (n) < C3, from which it 



According to Proposition 8.4.2 , we also have 

ri n < C min \T'\ 1+ paJT') Jdet q < C?C 
T'aT£{T) 

where C is an absolute constant, which also reads 



\T\ 

23" 



T a/ det q, 



Combining this with (8.47), we have obtained the estimate 



N{T, Vn ) < c 5 c;°c T v - T \T\^d^' 



which by Proposition 8.4.2 is equivalent to (8.46) with n = rj n . In order to obtain (8.46) 
for all arbitrary values of r), we write that r] n+ i < i] < n n for some n > no, then 

N (T, rj) <N(T, Vn+1 ) 

< C 5 2 3 ( n+1 ) 

< 8C 5 C 3 ^C^--|T| v ^et^ r 

< SCs^C^^Tlv^et 7 ^", 

which by Proposition 8.4.2| is equivalent to ( 8.46[ ). In the case where rj > r/ no , we simply 
write 

N(T, V ) <N(T,r, no ) 

< C 5 2 3no 

< 64C 5( x q (T) A 

= 64C 5 r] T \T\^/deJ^ T 

< QAC 5 r]- T \T\^detq T , 

and we conclude in the same way. 
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We remark that combining the estimates (8.45) and (8.46) in the above theorem yields 
the optimal local convergence estimate 

\\f-M\L*(T) < cl\W&et(<Pf)\\ L r (T) N(T^)-\ 



In order to obtain the global estimate of Theorem |8.4.1[ we need to be ensured that after 
sufficiently many steps of the greedy algorithm, the target / can be well approximated 
by quadratic function q = q{T) on each triangle T, so that our local results will apply on 
such triangles. This is ensured due to the following key result. 

Proposition 8.4.10. Let f be a C 2 function such that d 2 f(x) > ml for some arbitrary 
but fixed m > independent of x. Let Tn be the triangulation generated by the greedy 
algorithm applied to f using the L l decision function given by (8.18). Then 

lim max diam(T) = 0, 



i.e. the diameter of all triangles tends to 0. 

Proof: Let T be a triangle with an angle 9 at a vertex zq. The other vertices of T can be 
written as z\ = zq + au and z<i = zq + (3v where a, f3 G 1R + and u, v 6 M 2 are unitary. We 
assume that au is the longest edge of T, hence 9 < 7T/2. Observe that 



P(T) 



14 



or 



2a 



|T| \al3sm9 fism9 : 



and 



\u — v\ 



2 sin 



SU16 1 



2a 



cos 



/3p(T) cos( 



Since ^ < cos (|) we thus obtain 



. 2y/2a 

\U — V\ < — — 7=r < 



3a 



Pp(T) ~ (3p(TY 



We now set M :- 



M 2 /IU°°(n) and for all 5 > let 

uj(5):= sup \\d 2 f{z)-d 2 f{z')\\. 

z,z'&Q,\\z-z'\\<S 



For t G R, we define 



if" :— d 2 f ZQ+tu and H v t :— d 2 f ZQ +tv 



and notice that \\H^ — H^\\ < u(t\u — v\). Hence, if < t < (3, we have 



H;\\<u 



f 3a \ 
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Furthermore, for all t we have 

\(H? u,u) - (Hf v,v)\ = \(H? u,u) - (H? u- (u - v),u - (u - v))\ 

= \2(H? u,u-v) - (H? {u-v),u-v)\ 
< 2M\u\\u -v\ + M\u - v\ 2 

/ M I 9 3a/3 _,_ ( 3a V 
^ p I / p(T) + \p{T)) 



Applying the identity (8.20) to the edges e = au and (3v, and using a change of variable, 
we can write 

D T (au,f)= / min{t, a -t}+(H^u, u)dt and D T ((3v,f)= / min{£, (3 - t} + (H^ v, v)dt 
Jr Jr 

where we have used the notation r + := max{r, 0}. Hence, noticing that 

f X (^+) 2 
min{t, A — t} + dt = / min{t, A — t}dt = — - — , 



o 



and using the previous estimates we obtain 

D T {au,f) - D T {(3v,f) = I (min{t,a-t} + (H?u,u) - mm{t, - t}+(H? v,v)) dt 

Jr 

= / (min{£, a — t} + — min{t, (5 — t} + )(H^ w, u)dt 
Jr 

- [ mm{t,/3 -t} + ((H"v,v) - (H?u,u))dt 
Jr 

> m (min{t, a — t} + — min{t, f3 — t} + )dt 
Jr 

-j mm{t,P-t}(\(H?u,u)-(H?v,v)\ 
+\({H? - H?)v,v)\)dt 

a 2 -/3 2 M ( 3a/3 ( 3a \ 2 \ f3 2 ( 3a \ 

- m ^ — T{ 2 m + {w)) )-^\W)) 

a 2 -(3 2 a 2 ( 6 9 / 3a \ \ 

- m ^ — ^\W) + WY>^\W)))- 

where we have used the fact that a > /3 in the last line. We can therefore write 

D T (au, f) - Dx(/3v, f) > ™ (a 2 (l - Q (-^y) ) - /3 2 ) , (8.48) 



where we have set 



u(5) ■= — (65 + 95 2 + co(3 diam(fi)5) 
m \ 
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The inequality (8.48) shows that dx(-, f) prescribes a w^^yj-near longest edge bisection 

in the euclidean metric for any triangle T. Indeed if the smaller edge (3v was selected, we 
would necessarily have 



\H 2 = /? 2 > (l 



to 



P(T)) 



a 



P(T)) 



era 



Notice that u)(S) -> as 5 -> 0. 

Since / is strictly convex, there does not exists any triangle T C Q such that er(/) p = 
0. Let us assume for contradiction that the diameter of the triangles generated by the 
greedy algorithm does not tend to zero. Then there exists a sequence (7i)i>o of triangles 
such that Tj + i is one of the children of Tj, and hx t — > d > as i — >oo, where hr denotes 
the diameter of a triangle T. Since |Tj| — > 0, this also implies that p(Tj) — > +oo as i — > oo. 

We can therefore choose i large enough such that h\_ < |<i 2 and C2a)^^ryj < | for all 

j > i, where G% is the constant in Proposition 8.4.6 According to this Proposition, we 
have 

3 

v(T i+3 ) < -a(Ti), 



where o stands for cr q in the euclidean case q 



x 



h 2 h 2 

any triangle T, g^j- < cr(T) < ^m, from which it follows that 



y 2 . On the other hand, we have for 



hi < 4 

+ 3 — 



Ti + 3,\a(Ti 



i+3, 



\ T M{ T i, 



h T . < -h 



Ti ■ 



Therefore, hT i+3 < d which is a contradiction. This concludes the proof of Proposition 
18.4.101 



o 



Proof of Theorem 8.4.1 Since / e C 2 , an immediate consequence of Proposition 8.4.10 
is that for all fi > 0, there exists 

such that for all T e 7/Vi, there exists a quadratic function q T such that 

d 2 q T < d 2 f < (1 + n) d 2 q T . 
Therefore our local results apply on all T e Tn x ■ Specifically, we choose 

N 1 :=N 1 (f,c 2 ), 



with C2 the constant in Theorem |8.4.9[ We then take 

\T\ 

T] < Vo ■= min <{ e T (f) P , ' 



TeT Nl 



T v/det 



We use the notations 

fr, = fN, % = T N , N = N(rj) = #(T V ) = #(T N ), 
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for the approximants and triangulation obtained by the greedy algorithm with stop- 
ping criterion given by the local error rj. Note that T v is a refinement of Tn 1) since 
n < min^eT^ er(/) p , and therefore iV > N%. We obviously have 

||/-/jv||xp<^5. 



Using Theorem 8.4.9 we also have 



N= N{T,n) <C,r T \W^{<Pf)\\lr 

and therefore 



(«)' 

T£T Nl 



\\f-f N \\ <C£Wdet(d*f)\\ L . {Q) N-\ 
which is the claimed estimate. Since we have assumed i] < i]q, this estimate holds for 

N>N , 

where N is largest value of N such that er(f) P > f]o for at least one T e 7iv- o 



Remark 8.4.11. In Chapter^ a modification of the algorithm is proposed so that its 
convergence in the LP norm is ensured for any function f G L P (Q) (or f G C(Q) when 
p = oo). However this modification is not needed in the proof of Theorem 
the assumption that f is convex. 



.4-1, due to 
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9.1 Introduction 

We study in this chapter several variants of the greedy algorithm that was discussed 
in Chapter [7] and Chapter [8j 

One of the key features of this algorithm is the decision function which governs the 
creation of anisotropy by selecting among the available directions of refinement of a tri- 
angle. It was proved in Chapter M that for piecewise linear approximation, the L 1 based 



decision function (9.1) leads to optimally adapted triangles. We consider in £9.2 two al- 
ternative decision functions, based on the L 2 approximation error or the L°° interpolation 
error respectively, both in the context of piecewise linear approximation. The study of 
these decision functions is motivated by the following reasons. The L 2 based decision 
function can be computed at a significantly smaller numerical cost than the L 1 based 



385 



386 



Chapter 9. Variants of the greedy bisection algorithm 



or L°° based decision functions, and is therefore the most suited method for numerical 
applications. The L°° based decision function is computationally more costly, but it leads 
to more general convergence results. 

We then focus our attention in §9.3| to the behaviour of the greedy algorithm for 
piecewise linear approximation when applied to cartoon functions. Unfortunately, it turns 
out that the algorithm fails to achieve the best convergence rate (iV -1 in the L 2 norm) 
expected for such functions. This is inherently due to the fact that the geometry of the 
bisections used in the algorithm is too limited, regardless of the decision function which is 
being used. We thus consider some modifications of the greedy algorithm which partially 
solve this problem using alternative bisection choices. Instead of bisecting a triangle from 
a vertex to the midpoint of the opposite edge, we offer different possibilities which lead 
to better directional selectivity. In turn we obtain optimal convergence rates for simple 
cartoon functions of the form f = Xp where P is any half plane. The behavior of the 
greedy algorithm on general cartoon functions remains an open question. 

Finally, we consider in [9A a variant of the greedy algorithm which is in some sense 
much simpler. It is based on piecewise constant approximation, instead of piecewise linear, 
and it produces partitions of the original domain into rectangles aligned with the coordi- 
nate axes, instead of triangles of arbitrary direction. Thanks these simplifications we were 
able to analyze this algorithm in a rather general setting, and we establish in Theorem 



9.4.2 a convergence estimate which applies to any C 1 function and which is in accor- 



dance with the convergence estimate established in Chapter 1 for optimized rectangular 
partitions. 



9.2 Alternative decision functions 

Let us briefly recall the two steps of the greedy refinement algorithm studied in Chapter 
and Chapter [8]: 

a) A triangle T G T which maximizes the approximation error is selected. 

T = argmax e T >(f) p . 
T'eT 

b) For each edge e G {a, b, c} of the triangle T, the bisection of T from the midpoint of e 
to the opposite vertex defines two children T\ and T 2 . The edge e which minimizes a 
given decision function drifi, f) is selected 

e := argmin dr(e', /). 

e'£{a,b,c} 

and we define the new triangulation T 7 by 

r :=T-{T} + {T*X}. 



Theorem 8.4.1 states that if the function / to be approximated is C 2 and strictly 
convex, and if the decision function is given by 



d T (e, f) :=\\f - l T i /||ii(T e i) + 11/ - It| /IUto, 



(9.1) 
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then the sequence of triangulations generated by the greedy algorithm satisfies the optimal 
error estimate 

' (9-2) 



limsupiVe rjv (/) p <C y/\det(d?f)\ 



We now want to investigate two alternate choices for the decision function dx(e,f). 
The first of these choices is based on the L 2 approximation error 



d T (e, f) :=\\f- P n f\\\ HTl) + 11/ - P Ti f\\h 



2 (T|)> 



(9.3) 



where, for any triangle T, we denote by P? the L 2 (T) orthogonal projection onto the 
space Pi of affine functions. The second choice is based on the L°° interpolation error, 



d T (e, f) ■= 11/ - Iti fh-W) + 11/ - It| /||l 



(9.4) 



Our motivation for studying the L 2 based decision function is computational. Indeed, 
from the point of view of computer run time, the biggest cost of the greedy algorithm 
comes from the evaluation of the decision function. As exposed below, a trick allows to 



evaluate the (discretized) L 2 based decision function (9.3) much faster than the L l based 
or L°° based decision functions (9.1) and (9.4). The greedy algorithm has been imple- 



mented in C++ by Lihua Yang. For an image of realistic resolution, say 512 x 512, the 
(discretized) greedy algorithm takes only two seconds on a standard laptop computer to 
generate 5000 triangles using the L 2 based decision function if it is efficiently evaluated. 
In contrast it may take up to a minute to generate the same number of triangles based 
on a brute force evaluation of the L 2 based decision function, or the L 1 based or L°° 



based decision functions. As a result the L 2 based decision function (9.3) is the preferred 
one in numerical experiments. We shall prove that the greedy algorithm based on the L 2 
decision function generates a sequence of triangulations satisfying the optimal estimate 
(9.2) when it is applied to any quadratic function / = q such that the quadratic form q 



is non-degenerate. In particular q may be either stricly convex, concave or of hyperbolic 
type. The latter case, which corresponds to a mixed signature (1, 1) of the quadratic form 
q, is not treated in the study in the previous Chapter M of the L 1 based decision function 



(9.1). 



The decision function based on the L°° error (9.4) is comparable to the L l based deci- 



sion function in terms of computational cost, but it leads to the most complete theoretical 



results. Indeed we shall prove that the optimal convergence estimate (9.2) holds for any 
quadratic function f = q such that the quadratic form q is non degenerate, as well as for 
functions which are C 2 and stricly convex. The latter case was treated in Theorem 8.4.1 



of Chapter [8] for the L 1 based decision function, but is not established for the L 2 based 
decision function. 



Before turning to the detailed study of the different decision functions, we expose as 
announced why the (discretized) L 2 based decision function (9.3) is much less expensive 
to compute than the L 1 based or L°° based decision functions, in terms of computer run 
time. 



388 



Chapter 9. Variants of the greedy bisection algorithm 



Let T be a triangle, and let Mt be the following 3x3 symmetric positive definite 
matrix, the Grammian matrix of the basis (l,x,y) of the subspace Pi of L 2 (T), 



M T := / ( x x 2 xy \ dxdy= / (1, x, y)(l,x, y) l dxdy. 
y xy y 2 





T 



Let also Vr{f) G M 3 be the vector of the first order moments of the function / on T, 

V T (f) := J (l,x,y) T f(x,y)dxdy. 

The L 2 (T) orthogonal projection of / onto Pi has the expression 

P T (f) = (l,x,y)M- 1 V T . 
Therefore, denoting by (-, •) the L 2 (T) scalar product, 

\\f ~ PtJWUt) = \\f\\h(T)-2(f,PT(f)) + \\PT(f)\\l, iT) 

i 2 (T) ~ ll-Pr(/)|li2 (T) 



f(x,y) 2 dxdy - V T (f) T M^V T (f). 

Jt 

Hence the L 2 (T) approximation error of / on T has an expression in terms of the integrals 
on T of the functions 1, x, y, x 2 , xy, y 2 , and /, xf, yf, f 2 . The Fubini formula transforms 
an integration on a bidimensional domain into two successive one dimensional integrations. 
Indeed for any triangle T and any g G L 1 (T), 



g(x,y)dxdy = / g(x,y)dx } dy 

y*(T) . 

(G(x*(T,y),y)-G(x*(T,y),y))dy. 
y*(T) v 



(9.5) 



Where we have used the notations 



G(x,y) := / g(x,y)dy 



y*(T) := min{y 6l; 3a; G R such that (x, y) G T}, 
x*(T,y) := min{x G R ; (x,y) G T}, 

and where y*(T) and x*(T,y) are obtained by taking the maximum instead of the mini- 



mum in the definitions of y*(T) and x*(T,y) respectively. The important point in (9.5) 



is that the function G does not depend on the triangle T. If this function is known, then 



(9.5) transforms the bidimensional integration of g into a one dimensional integration. 

This strategy extends to the discrete setting with the Lebesgue measure on T replaced 
by the counting measure at pixels where the approximated function / is sampled and which 
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have their center contained in T. The integrals of the type j T g(x,y)dxdy appearing in 
the previous equations are therefore replaced by discrete sums 



E 

(*j)6T 



9\h3, 



(9.6) 



where (i, j) stands for the center of the pixel, and without loss of generality runs over 
{0, • • • ,N — l} 2 for a. N x N image. In particular the quantity ||/ — PrfW^ appearing in 
the definition (9.3) of the L 2 based decision function is replaced with 



J) - Hhjj 



(9.7) 



Again, this quantity can be expressed in terms of the (discrete) integrals of the functions 



1, x, y, ■■■ ,xf,yf,f 2 , which do not depend on T. The Fubini formula (9.5) is replaced 
with 

E 9(i,j)= J2 G iHT,j)d ~ G u(T,j),j (9-8) 

(M)6T j*(T)<j<j*(T) 

where 



G 



'■j 



3*{T) 



0<i'<i 



and similarly j*(T) = [y*(T)\ and i*(T,j) = [x*(T,j)\. The points [i m (T, j), j), for 



all j*(T) < j < j*(T), are illustrated on the right of Figure 9.1 and surrounded by 
grayed squares. On the same figure the points (i*(T,j), j), for all j*(T) < j < j*(T), are 
surrounded by white squares. 

Consider a fixed function g, such as 1, x, y, x 2 , xy, y 2 or /, xf, yf, f 2 which are needed 



for the evaluation of (9.7), for which the sum (9.6) needs to be evaluated for a large 
number of distinct triangles. The matrix {Gij), 0<i<N,l<j<Nis evaluated 



once, and for any triangle T the sum (9.6) is computed using (9.5) which involves a much 



smaller collection of points, see Figure S^L This method greatly accelerates the numerical 
implementation of the greedy algorithm, especially for the large and isotropic triangles 
that occur in its first steps. 



9.2.1 Positive quadratic functions 



In this section, we study the algorithm when applied to a quadratic polynomial q such 
that det(q) > 0. We shall assume without loss of generality that q is positive definite, 
since all our results extend in a trivial manner to the negative definite case. 

We first establish that the refinement procedure, using either the L 2 based decision 
function (9.3) or the L c 



sense of the q-metric 



u\ 



based (9.4), always selects for bisection the longest edge in the 
j := \J q(w) for u G M 2 , as was already the case for the L 1 based 
decision function (9.1). This is used to prove that the refinement procedure produces 
triangles which tend to adopt an optimal aspect ratio. 
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Figure 9.1 - The cost of evaluating the interpolation error of a function / on a triangle 
T in the L 1 or L°° norm is proportional to the number of points with integer coordinates 
on T (left). Fewer points need to be considered to compute the L 2 projection error (right). 



The L°°-based split 

Let us denote by 



oir(f) ■= 11/ - ir/IU«(r), 



the interpolation error in the sup norm. The decision function (9.4) can be re-expressed 

as 

d T (eJ) = a n (f) + a T 2(f). (9.9) 

Theorem 9.2.1. If |a| q > max{|6| q , |c| q } ; then dx^a^q) < mm{dT(b,q),dT(c,q)}. The- 
refore the refinement procedure based on (9.9) selects the longest edge in the sense of the 
(\-metric. 

In order to prove this result, we need to study the interpolation error in detail. 

Proposition 9.2.2. Let T be a triangle with edges a,b,c such that |a| q > |6| q > |c| q; and 
let w G IR 2 and r > be the center and radius of the circumscribed circle for the q-metric, 
i.e. such that \v — u>| q = r for all the vertices v of T . Then 

U2 

< ar{q) < r 2 . 



4 



Right equality holds if T is acute, i.e. (Qb,c) < 0, where Q is the symmetric matrix 
associated to the quadratic form q. Left equality holds ifT is obtuse, i.e. (Qb,c) > 0. 

Proof: At any point u £ 1R 2 , we have 



(q — lTq){u) — \u — if| 2 — r 2 . 

Indeed, the difference (q — lTq)(u) — (\u — w| 2 — r 2 ) is an affine function of u, which 
vanishes at the three vertices of T. Hence this difference is zero. 

The function \u — w| 2 — r 2 is negative on T with maximal value at the vertices. If T 
is acute, then its minimal value on T is — r 2 and is attained at w G T. If T is not acute, 
then the minimum is attained at m a , the midpoint of a, and if we choose a vertex v at 
one end of a, we obtain the value at the minimum by Pythagoras' identity which gives 

(q - I T q){m a ) = \m a - w| 2 - r 2 = \m a - w\\ - \v - w| 2 
= -\v-m a \ 2 = -|a| 2 /4. 
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o 



The dichotomy in the above result is illustrated in the case of the euclidean metric on 
figure 4. Note that it would be sufficient to establish the above proof in this particular case, 
since we can perform an affine coordinate change = Q~? such that qo0 is the standard 
euclidean form and that the L°° interpolation error is left invariant by this coordinate 
change. 





Figure 4 : maximum point for the L°° interpolation error 



We now prove the following result which clearly implies Theorem 9.2.1 



Proposition 9.2.3. Let T be a triangle with edges |a| q > |6| q > |c| q . We then have 
d T (b,q)-d T (a,q) > -(q(a) - q(6)) 
d T (c,q)-d T (a,q) > \ ( q(a) - fi*k±M 




Figure 5 : Notations in the proof of Proposition (9.2.3) 



Proof: We introduce sub-triangles Tl 



1, 2 and e = a,b, c, as defined in Figure 5, 



which correspond to the three refinement scenarios. With such definitions, the following 



inequalities are easily derived from Proposition (9.2.2) 



4a T |(g) = q(6) (since is obtuse) 

4a T i(g) > q(a) 

Aa T i(q) > q(a) 

Aa T 4q) > q(6) 
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On the other hand, we shall prove 



a T i(q) < a T 2(q), 



(9.10) 



and 



4a T i(g) 



< 



(9.11; 



The proof of (9.10) follows from elementary geometric observations. Let L be a line which 



is parallel to c but does not contain it, and for x £ L denote by T x the triangle of vertices 
x and the end points of c. Denoting respectively by u(x) and v(x) the diameter of T x 
and of its circumscribed circle for the q-metric, we remark that these functions decrease 
monotonously as x tends to a point x c which is the orthogonal projection (also for the q- 
metric) of the mid-point of c onto L. Since the function x t— > otT x {q) is continuous in x and 
equal to u(x) or v(x) at all x, we conclude that this function also decreases monotonously 
as x tends to x c . Applying this observation to the line that contains m a and m;, the mid- 



points of a and b, and remarking that m a is closer to x c than mj, we conclude that (9.10) 
holds. 



From (9.10 ) and the first set of inequalities, we obtain the first statement of the theorem 
since 

d T {b, q) - d T (a, q) = a T i (q) + a T 2 (q) - a T 2 (q) - a T i (q) 
> a T i(q) - a T 2(q) > |(q(a) - q(6)). 



The proof of (9.11) also follows from elementary geometric observations. In the case 
where is obtuse, one of its edges e is such that 4a T i = q(e), and (9.11) follows since 
l e lq < ^(Nq + I c lq) f° r & H e ; using triangle inequality. Let R be an acute triangle of vertices 
u, v, w and let m = (u + v)/2. Remarking than the center of the (euclidean) circumscribed 
circle to R lies inside R, one easily checks by convexity that the (euclidean) diameter of 
this circle is smaller than |m — u\ + |m — w\. In the case where is acute, the diameter of 
its circumscribed circle is thus bounded by |(|6| q + |c| q ), as illustrated on Figure 6 when 
q is the euclidean metric. 



From (9.11) and the first set of inequalities, we obtain the second statement of the 



theorem since 



d T (c, q) - d T (a, q) = a T 2 (q) + a T i (q) - a T 2 (q) - a T i (q) 



> 



q(6) + q(o) - q(6) 



[Ma±Nq 

2 



q(aj 



2 



The L 2 -based split 

We now denote by 



M) ■= \\f - PrfWiHT), 
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the orthogonal projection error in the L 2 norm. The decision function (9.3) now writes 

d T (eJ) = Mf) 2 + Mf) 2 . 



(9.12) 



We shall prove that the refinement procedure based on (9.12) behaves in a similar way as 
@. 

Theorem 9.2.4. If d,e G {a,b,c} are two edges such that |d| q < |e| q; then dx{e,q) < 
dT{d,q). Therefore the refinement procedure based on (9.12) selects the longest edge in the 
sense of \ • | q . 

In order to prove this result, we first provide with an algebraic expression of Pt{q) 
which is valid for any quadratic function q. 

Proposition 9.2.5. Let T be a triangle with edges a,b,c and area \T\, and let q be a 
quadratic function. Then 



P 2 (q) = \T\ ( Cl (q(a) + q(6) + q(c)) 2 - c 2 det(q)|T| 2 ) . 



(9.13) 



with constants c\ = and c 2 = Ciy 



4 

225 ' 



Proof: We first prove (9.13) on the triangle R of vertices {(0,0), (0, 1), (1,0)}. It is easy 
to compute the integrals on R of monomials x k y l , k + I < 4. Using these quantities, we 
can derive the orthogonal projection of a quadratic function thanks to a formal computing 
program, which gives us 



q = ux 2 + v y 2 + 2wxy 



„ u + v + w 2x .„ , 2w.„ . 

PrCL = + —(2u + w) + -f(2v + w). 

10 5 5 



This yields the following expression for the L 2 -squared error between q and its projection 



7w r " 



uw — 2vw + 



which is equivalent to (9.13). 

For an arbitrary triangle T, using an affine bijective transformation <fi from R to T, 
we have 
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where q = q o and is the constant jacobian of 0. Using the validity of (9.13) on R 
and the fact that ITI = JAR\, we thus obtain 



P T (qY = |T| (c!(q(a) + q(6) + q(5)) 2 - c 2 det(q)|i? 

where q is the quadratic form associated to g and e denotes the edge segment of R mapped 
onto e by 0. Since q(e) = q(e) and det(q) = J? det(q), we obtain (9.13) for T. o 



We now prove the following result which clearly implies Theorem 9.2.4 



Corollary 9.2.6. Let T be a triangle with edges a,b,c and area \T\, with |a| q > |6| q and 
lain > IcL. Then 



d T (b,q)-d T (a,q) > -^T^af - q(6) 2 ), 

5 

d T {c,q) - d T {a,q) > -Ci|T|(q(a) 2 - q(c) 2 ). 



(9.14) 
(9.15) 



Proof: The children triangles all have area \T\/2, and take their edges among a,b,c, 
a/2, 6/2, c/2 and £=« (recall that a + 6 + c = 0). We use the identity 

q(w + u) + q(w - f ) = 2q(u) + 2q(w), 

which is valid for all quadratic forms, and implies 



q 



b — c 



q(6) + q(c) q(o) 



Using (9.13), this allows us to compute the local projection errors for the children of T. 
For example bisecting the edge a creates two children T' and T" with edges |, b, ^ and 
§, c, and therefore 



l^| ci q 



q(6) + q 



c-6 



c 2 det(q)|T' 



7 1 2 



and similarly 



|T| 



/3^(g) 



Cl 



|T| 



3q(6) + q(c) 



c 2 det(q) 



ITI 



Cl 



3q(c) + q(6) 



- c 2 det(q) 



ITI 



Adding up, we thus obtain 
IT 

d T (a,q) = — 



( Cl (3q(6) + q(c)) 2 + Cl (q(6) + 3q(c)) 2 - 2c 2 det(q)|T| 2 ) . 

Subtracting this from the analogous expression for drib, q) we obtain 

CilTl 



dr(b,q) - d T (a,q) 



(5 (q(a) 2 -q(6) 2 )+6q(c)(q(a)-q(6))) 



(9.16) 



which implies ( 9.14| ). Exchanging b with c we obtain (9.15) which concludes the proof. 
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Convergence towards to optimal aspect ratio 



We have established in Theorems 9.2.1 and 9.2.4 that for any triangle T and any 
quadratic function g 6 P 2 such that the homogeneous component q G H 2 is positive 
definite, the decision functions (9.3) and (9.4) based on the L 2 or L°° error lead to the 



bisection of the same edge of T : the longest edge of the q-metric. 



Similarly it is established in Lemma 8.3.1 in the previous chapter that the decision 



function (9.1 ) based on the L l interpolation error leads to the bisection of the same edge. 



As a result, the decision functions can be interchanged without changing the result of 



Corollary 8.3.10 which extends as follows. 



Theorem 9.2.7. Let Q be a triangle, and let q be a quadratic function with positive 
definite associated quadratic form q. Let q^ be the approximant of q on Q obtained by the 
greedy algorithm for the LP metric, using the L 1 , L 2 or L°° decision function (9.1), (9.5), 
Then 

limsupiV||g - q N \\ LP(Q) < C||y / det(q 



(94) 



7V->oo 



(9.17) 



where ~ = - + 1 and where the constant C depends only on on the choice of the approxi- 
mation operator At used in the definition of the approximant. 



The local perturbation analysis exposed in the previous chapter, £ 8.4.2 for the L 1 
based decision function extends without difficulties to the L 2 and L°° based decision 



functions. From this point, it is not difficult to extend Theorem 9.2.7 to functions / which 
are close enough to a positive definite quadratic function q, in the sense that 



d 2 q < d 2 f < (l+fx)d 2 q 



(9.18) 



for a constant \x > small enough. 

Let us consider a C 2 and strictly convex function / defined on a polygonal domain 
Q. In order to establish an asymptotical error estimate, we need to be ensured that 
after sufficiently many steps of the greedy algorithm, the target function / can be well 
approximated by a quadratic function q = qx on each triangle T, in the sense of (9.18), 



so that our local results will apply on such triangles. This property follows from the next 



proposition, which is an extension of Proposition 8.4.10 in which the L l based decision 
function (19.11) is replaced with the L°° based decision function. We postpone its proof to 



the appendix §9.5.2| 

Proposition 9.2.8. Let f be a C 2 function such that d 2 f(x) > ml for some arbitrary 
but fixed m > independent of x. Let Tn be the triangulation generated by the greedy 
algorithm applied to f using the L°° decision function given by (9.4). Then 



lim max diam(T) = 0, 



i.e. the diameter of all triangles tends to 0. 



As a consequence, we may extend Theorem 8.4.1 to the L°° based decision function. 
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Theorem 9.2.9. Let Q be a polygonal domain and let f G C 2 (Q) be such that d 2 f(x) > 
ml for all x G Vt, for some arbitrary but fixed m > independent of x. Let f^ be the 
approximant obtained by the greedy algorithm for the LP metric, using the L l decision 



function (9.1) or the L°° decision function (9.4)- Then 



limsupiV||/-/ JV |U P <C||Vdet(^/)|| LT) 



where 1 = i + 1 and where C is a constant independent of p, f and m. 



9.2.2 Quadratic functions of mixed sign 

In this section, we study the algorithm when applied to a quadratic polynomial q such 
that det(q) < 0. We shall follow the same steps, and reach similar conclusions, as in the 
positive definite case, using a measure of non- degeneracy which is equivalent to p q (T). If 
a triangle T has edges a,b,c such that |q(a)| > |q(6)| > |q(c)|, we will still refer to a as 
the "longest" edge in the sense of q, although q does not define a proper metric anymore. 
Recall that 

|q(a)| 



PqCO := 



iTlVldetqf 



The following inequalities that will be repeatedly used in this section can be derived 
when p q (T) is large enough. We postpone their proof to the appendix. 

Proposition 9.2.10. Let T be a triangle such that |q(a)| > |q(6)| > |q(c)| ; and define 
d= b -^. 

If PqiT) > 4, then q(a)q(6) > 0, |q(a)| > 4|q(c)| and |q(o)| > 4|q(d)|. (9.19) 



Ifp^T) > 8, then |q(a)| < ||q(6) + q(c)|. (9.20) 



The L°°-based split 



Theorem 9.2.11. The refinement procedure based on (9.9) selects the longest edge in the 



sense o/q : if |q(o)| > max{|q(6)|, |q(c)|} and p q (T) > 4, then 

d T (a, q) < min{d T (6, q), d T (c, q)}. 

This theorem is very similar to the one for positive quadratic functions. In order to 
prove it, we first study the interpolation error which has a simple form in this context. 

Proposition 9.2.12. Let T be a triangle with edges a,b,c. Then 

a T (q) = -max{|q(o)|, |q(6)|, |q(c)|}. 
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Proof: Let xq be the point of T at which the interpolation error is attained : xq = 
argmax T \q — Itq\- If %o is i n the interior of T, then it must be a local extremum of 
q — ItQ- However this function has only one critical point on M 2 , which is not an extre- 
mum since q has mixed signature. Therefore Xq must lie on an edge. On each edge of T, 
the function q — l^q is a one dimensional quadratic function vanishing at the endpoints. 
It follows that xq must lie in the middle of an edge and the result follows. o 



Proposition 9.2.13. Let T be a triangle with edges |q(a)| > |q(^)| > |q( c )l an d such 
that p q (T) > 4. Then 

|qQ)l - |q 

8 

|q(o)l 



dr(b, q) - d T (a, q) > 
d T (c, q) - d T (a, q) > 

o 

Proof: The bisection through the edge a creates two sub-triangles T^, T% of edges respec- 
tively d and |,c, d. Using the last two inequalities in (9.19) we obtain that 4a T i = 
max{|q(a/2)|, |q(fe)|} and 4o;t2 = |q(a/2)|. Therefore 

a r / \ |q( a )l f |q( a )l i /,m 

4d T (a } q) = ^^+max|^^,|q(6)| 

On the other hand, the choice of bisecting the edge b creates two subtriangles respectively 
containing the edges a and |, and the choice of bisecting the edge c creates two subtriangles 
respectively containing the edges a and b. This provides us with the lower bounds 

4d T (M) > |q(a)| + i^l, 
M T (c,q) > |q(a)| + |q(6)|. 

The proposition follows easily, distinguishing between the two cases |q(a)| < 4|q(6)| and 
|q(a)| >4|q(6)|. o 



The L 2 -based split 



The same conclusions can be reached for the refinement procedure based on (9.12). 



Theorem 9.2.14. The refinement procedure based on (9.12) selects the longest edge 
in the sense of q : if |q(a)| > max{|q(6)|, |q(c)|} and p q (T) > 4, then dr(a,q) < 
mm{d T (b,q),d T (c,q)}. 



Proof: The expression found in (9.16) remains valid when det(q) < 0. Substituting a by 
b or c and subtracting, we obtain 



d T (b,q) - d T (a,q) 
dr(c,q) - d T (a,q) 



fm( q( a)- qW )( S+ <f 
^|Tl(q(.)-q(c))f. + SW 
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where s = q(a) + q(6) + q(c). Using (9.19), we see that the quantities 



q(b) q(c) , . ... , , . , , 

, s+^i q(a)-q(o) and q(a) - q(c) 
5 5 

all have the same sign as q(a) and are non-zero. It follows that 

d T (a,q) < mm{d T (b,q),d T (c,q)} 
which concludes the proof. o 

Convergence toward the optimal aspect ratio. 

We have proved that the refinement procedure - either based on the L°° or L 2 decision 
function - systematically picks the longest edge in the sense of q if Pq(T) > 4. Similarly 
to the positive definite case, we now study the iteration of several refinement steps and 
show that the generated triangles tend to adopt an optimal "aspect ratio" in the sense 



of the measure of non-degeneracy p q (T) introduced in £8.2 If T is a triangle with edges 
a, b, c, let us recall that 

„ ( r p\ _ max{|q(q)|, |q(6)|, |q(c)|} 



As in £8.3, we introduce a close variant to p q (T). If T is a triangle with edges a,b,c, 
we define 

(T) . = min{|q(a) + q(6)|, |q(6) + q(c)|, |q(c) + q(o)|} 

4|T|v^det^J 



Note that if q was a positive quadratic form, this definition is consistent with (8.22). We 
define our measure of non-degeneracy /t q by 

«q(r) := max{a q (T), 5/2}. (9.22) 

We first show that the quantities K q and p q are equivalent. 

Proposition 9.2.15. For any triangle T , one has 

2a q (T) < Pq (T), (9.23) 

and 

4 32 

gKqCO < Pq (T) < yK q (T). (9.24) 

Proof: We denote by a,b,c the edges of T, and we assume that |q(a)| > |q(6)| > |q(c)|. 



The inequality (9.23) follows directly from the triangle inequality 



2|T| v ^det r ^r q (T) < lq(&) ^ q(c)l < |q(a)| < |T| v^deI^p q (T). 
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As mentioned earlier, p q (T) is always larger than 2 and therefore (9.23) implies the left 



inequality in (9.24). 



It remains to prove the right inequality in (9.24). If p q (T) < 8, it is immediate since 
« q (T) > | and f | > 8. If p q (T) > 8 then q(o)q(6) > according to (|9 19[) and therefore 



|q(6) + q(c)| < |q(a) + q(c)| < |q(a) + q(6)|. 
We obtain using (9.20) that 

8|q(6) + q(c)| 



IQWI < » |qW + q^| = 32 < 32^ 
T|^de^J ~ 3 |r|v^tq| 3 ^ ' ~ 3 ql h 



which concludes the proof. 



o 



Similar to p q , the quantity K q is invariant by a linear coordinate changes 0, in the 
sense that 

Kqo*(T) = «q(0(T)). 

Our next result shows that K q (T) is always reduced by the refinement procedure. 

Proposition 9.2.16. IfT is a triangle with childrenTi andT 2 obtained by the refinement 
procedure for the quadratic function q, using the L 2 based or L°° based decision function, 
then 

max{K q (Ti), K q (T 2 )} < « q (T). 
Proof: Let us assume that a is the longest edge in the sense of q. In the case where 



Pq(T) > 4, we already noticed in the proof of Proposition 9.2.15 that 

|q(6) + q(c)| 



4|T|v^det^J 



Moreover, according Theorems 9.2.11 and 9.2.14 the edge a is selected by both decision 
functions. It follows that the children Tj have edges a/2, 6, (c — 6)/2 and a/2, {b — c)/2, c 
(recall that a + b + c = 0). We thus have 



2|7V|detq| cr q (T^ 



< 



c n 2 ' + q 



q 



6 + c 



q 



|q(6) + q(c) 



= 2|T|V|detq| a q (T). 

We have proved that cr q (Tj) < cr q (T), and it readily follows that « q (Tj) < Kq(T). 

In the case where p q (T) < 4, we remark that Tj contains at least one edge from T, say 
s G {a, 6, c} and one half-edge t E {|, |, |}. This provides an upper bound for (j q : 



|q(g) + q(t)| |q(o)| + |q(| 



5 /mx 5 

, . , - -Pq(T) < 

2|r|v^deTq7" 2|r|v^deTq7 8 rqV ; "2 



(9.25) 
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Therefore K q (Tj) = | < K q (T). 

Our next objective is to show that as we iterate the refinement process, the value of 
K q (T) becomes bounded independently of q for almost all generated triangles. If T is a 
triangle such that |q(a)| > |q(fe)| > |q(c)| and if the edge a is cut (which is the case as 
soon as p q (T) > 4, using the L 2 or L°° based decision functions) we define ipq(T) as the 
subtriangle containing the edge c. We first prove a result which is analogous to Proposition 

ion 

Proposition 9.2.17. If T is a triangle such that K q (-0 q (T)) > |, then /t q (^ q (T)) < 
§« q (T). 



Proof: Let S be a triangle such that Kq(i[>q(S)) > |. According to (9.25), one must have 
p q (S') > 4. We assume that the edges of S satisfy |q (q)| > |q(fe)| > |q(c)|. Since the three 
edges of i/jq(S) are |, c and d = it follows from (9.19) that the longest edge of if)q(S) 
in the sense of q is |. 
Since 

5 



« q (V q (T)) > ^\{T)) > « q (^(T)) > 



2' 



we can apply this observation to the triangles T, ipq(T) and ^(T). Therefore, denoting by 
a the longest edge of T in the sense of q, we find that | is the longest edge of ip^(T). Since 
|^ q (T)| = |T|/8, we obtain that pq(ip^(T)) = p q (T)/8. Using the results of Proposition 



9.2. 15[ we thus have 



which concludes the proof. 



^P q (T) < \Kq{T), 



An immediate consequence of Propositions 9.2.16 and 9.2.17 is the following. 



Corollary 9.2.18. If (T^)? =1 are the 8 children obtained from 3 successive refinement 
procedures from T for the function q, then 
- for alli,Kq{Ti) < Kq{T), 



there exists i such that , /c q (Tj) < |ft q (T) or K q (Tj) 



5 

2 ' 



We eventually obtain the two following results which proof are exactly similar to the ones 



of Theorem 8.3.9 and Corollary 8.3.10 



Theorem 9.2.19. Let T be a triangle, and let q a quadratic function of mixed type. Let 



k 



ln(2« q (T)/5) 
In3-ln2 



Then after n applications of the refinement procedure starting from T , at 



most Cn k 7 n / 3 of the 2 n generated triangles are such that Kq(S) > | where C is an absolute 
constant. Therefore the proportion of such triangles tends exponentially to as n — > +oo. 
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Theorem 9.2.20. Let Vt be a triangle, and let q be a quadratic function such that det q < 
0. Let qjsj be the approximant of q on Q obtained by the greedy algorithm for the L p metric, 
using the L 2 or L°° decision function (9.3), (9.4). Then 



limsup N\\q - q N \\Lp(n) < C\ 

N^oo 



/det(q) 



\Lt(SI), 



where ~ = - + 1 and where the constant C depends only on on the choice of the approxi- 
mation operator At used in the definition of the approximant. 



9.3 Alternative bisection choices, and the approxi- 
mation of cartoon functions 

A popular, although simplistic, model for images is the set of cartoon functions, see 
Definition |4.3.1 In summary a function / defined on a domain Q is a cartoon function 



if / is C 2 except along a finite collection of C 2 curves where / may have discontinuities. 
For any cartoon function / on the bi-dimensional domain Q — [0, l] 2 , the approximations 
(/at)jv>o obtained by retaining the iV largest coefficients in the wavelet expansion of / 



satisfy, as observed in (4.3) 



Wf-fN\\v>(n)<CN-*. (9.26) 

One of the purposes of anisotropic mesh adaptation is to improve on this estimate, and 
we therefore focus in this section on the L 2 approximation error. For any triangle T and 
any function / G L 2 (T), we define 

er{f) ■= inf ||/ - vr|| i2(T) = \\f - Mf)\\i?(T), (9-27) 

7rtIPi 

where Pt is the operator of L 2 (T) orthogonal projection onto the space Pi of affine 
functions. For any triangulation T of a polygonal domain Q, and any / e L 2 (Q), we 
denote by e-r(f) the error of approximation of / in the L 2 (Q) norm by discontinuous 
piecewise affine functions on 7 ' . This quantity is defined by 

erUf ■= E e ^ =\\f~ Mf)\\h { n): (9-28) 
TeT 

where Pr is the L 2 (Q) orthogonal projection onto the space of discontinuous piecewise 



affine functions on the triangulation 7. The heuristic analysis led in (4.15) suggests that, 
if / is a cartoon function on a polygonal domain fl, then there exists a sequence (7jv) 7v>tv 
of anisotropic triangulations of Q, satisfying #(7jv) < N, and such that 

er N {f)<CN-\ (9.29) 

We make in this section an attempt, not totally conclusive, to answer the following 



question : is it possible to produce a sequence (7n)n>n °f triangulations satisfying (9.29) 



using a hierarchical refinement algorithm? The analysis presented in £9.3.1 shows that 
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Figure 9.2 - (left) Three bisection choices S T , (right) Six bisection choices S T . 



unfortunately the refinement algorithm studied in the previous chapters does not satisfy 



(9.29). This failure is inherently due to the limited geometric selectivity of the algorithm 



due to the choice of bisections. For this reason, we study other types of bisections in 



j;9.3.2|and £9.3.3, also based on splitting a selected element (which may not anymore be a 



triangle) by a line cut. In this more general setting, the greedy algorithm has the following 
generic form : 

1. An element T which maximizes the approximation error is selected within the par- 
tition T ' . 

T = argmax ej"(/). 
T'eT 

2. The refinement procedure considers a finite set St of segments s along which the 
element T may be split into two, thus creating two children elements and T 2 . 
The segment s 6 St which minimizes a given decision function g?t(s, /) is selected 
and we define 

r :=T-{T} + {T'X}- 

Two possible choices for the set St, denoted by S T and S T , are illustrated on Figure 
9.2 The set S T contains three segments, which join one of the three vertices of T to the 



midpoint of the opposite edge. It corresponds to the three bisection choices studied in 
the previous chapters. The set S T contains a larger choice, namely six segments joining a 
vertex v 1 of T to the barycenter \v 2 + If 3 of the other vertices v 2 and v 3 . Other bisection 



choices, that lead to non-triangular partitions, are displayed on Figure 9.9 



We typically consider the L 2 based decision function : if an element T is bisected along 



a segment s, creating two smaller elements T} and T 2 , then 

d T (sJ) :=e n (f) 2 + e T? (f) 2 . 



(9.30) 



Starting from a triangulation T/v of the polygonal domain Q on which the function / 
needs to be approximated, with #(7at ) = N , the greedy algorithm creates step after step 
a sequence T/v 0+ i, T/v 0+ 2, ■ ■ • of triangulations of Q satisfying #(7jv) — N. At the time of 
writing, unfortunately, the author does not know how to choose the set of segments SV 
and the decision function oIt{s, f) such that given any cartoon function /, the sequence 
(Tn)n>n of partitions produced by the greedy algorithm satisfies the desired estimate 



(9.29). 



As an intermediate objective, we choose to study the behavior of the refinement proce- 
dure when / is a particularly simple cartoon function, the characteristic function f = Xp 
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when P is a half plane. Up to translation and rotation, we may always assume that P is 
of the form 

P:={(x,y)eR 2 ; y>0}. 

We denote by D the line 

D = dP = {(x, 0) ; x E K}, (9.31) 
and we observe that, denoting by T the interior of a triangle T, one has 

e T (x P ) = if and only if f D D = 0. (9.32) 

We shall first prove that, for certain configurations between the line D and the initial 
triangulation To, the refinement procedure based on the three bisection choices Sj> pro- 
duces a sequence of triangulations (Tn)n>i that does not satisfy the optimal convergence 
estimate 

e TN (Xp) < CN-K (9.33) 



We then show that if the six bisection choices Sj. illustrated on Figure (9.2) are allowed 



and if an appropriate decision function is used, then (9.33) is satisfied. Eventually, we 



consider another variant of the refinement procedure, which only involves three bisection 



choices at each step, and for which (9.33) is again satisfied. 



9.3.1 Three bisection choices : a negative result 

We show in this section that the greedy algorithm studied in the previous chapters 



beats the error estimate (9.26) associated to wavelet expansions when / = xp- However 



it fails to achieve the desired error estimate (9.33). More precisely, Proposition 9.3.4 
shows that, if the decision function dr(s,f) satisfies Assumptions 9.3.1, then the greedy 
refinement procedure for the function f = XPi starting from an arbitrary triangulation 
Tn , #(7iVo) = ^o, produces a sequence (Tn)n>n of triangulations, #(7jv) = N, such 
that 

e TN (Xp) < CN- X ' 2 , (9.34) 
where A/2 = 0.732 • • • . This estimate cannot be improved since we establish in Proposition 



9.3.2| that there exists a triangle T re f such that the sequence (Tn)n>i produced by the 
greedy refinement procedure for the function f — Xp starting from T\ = {T re f} satisfies 

er N (Xp) > cN- x > 2 (9.35) 

where c > 0. 

We say that two triangles T, T' are D-affme equivalent if there exists an affine change 
of coordinates 6 : 1R 2 — > 1R 2 , that transforms T into T' and leaves the line D defined at 



(9.31) invariant : 

0(T) = T' and 0(D) = D. 
If T and T' are D-affine equivalent, then it follows from a change of variables that 

\T\-h T { XP ) = \T'\'h T '{xp)- (9.36) 



The following assumptions on the decision function are illustrated on Figure (9.3). 
Their purpose is to create, through the refinement process, a thin layer of triangles along 
the line D. 
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Figure 9.3 - Illustrations of Assumptions 9.3.1, a) left and b) right. The line D (Thick), 
forbidden bisection choices (dashed), authorized bisection choices (full). 



Assumptions 9.3.1. Let T be a triangle such that ct{xp) ^ 0. In the following we 
work under the assumption that segment s 6 which minimizes the decision function 
dr(s,xp) satisfies the following properties. 

a) If there is such a possibility in S^, the bisection of T along s creates a child T' such 
that eT'ixp) = 0- 

b) If a) is impossible then the decision function selects a segment s e 5| joining a vertex 
of the triangle T to the midpoint of an edge which is intersected by the line D. 



These two properties are illustrated in figure |9.3| Numerical experiments strongly 
suggest that they are satisfied for the decision function based on the L 2 error which is 



defined by (9.30) 



We first prove the lower error bound (9.35), and for that purpose we denote by T rc f 
the triangle of vertices 

(0,-1), (3,-1), (0,2). 



T re f is the large triangle enclosing the others on Figure 9^4 . The refinement procedure 
applied to the triangle T ref and the function f = Xp creates an infinite master tree P re f of 
triangles. If V is a finite subtree of V Te f, (in other words V contains T ref and the nodes of 
V have either two or no children), then the leaves of V define a triangulation T of T re f. 

Proposition 9.3.2. There exists a constant c > such that : for any triangulation T 
associated to a finite subtree ofV TC {, we have 

erixp) > c#(7T A/2 (9.37) 

where c> is a positive constant and X = 1.464 ■ ■ ■ is the solution of 

i i 
1 \ 1+A / 1 \ 1+A 



(9.38) 



In particular let {Tn)n>\ be the sequence of triangulations generated by the greedy al- 
gorithm applied to the function f — Xp an d starting from T\ = {T rcf }. If the decision 
function satisfies Assumptions 9.3. 1\ then for all N > 1 one has 



(Xp) > cN- x l\ 
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Figure 9.4 - Subdivision of the triangle T re f , some of its children and grand-children, with 



a decision function satisfying Assumptions 9.3.1 The colored triangles satisfy ct(xp) — 0. 




A 




Figure 9.5 - Tree associated to the subdivision illustrated on Figure 9.4 (left), and 
equivalence classes of the elements of this tree for the relation of D-affine equivalence 
(right). 



Proof: Figures 9.4 and 9.5 reveal a surprising property of the master tree V Te f- After a 



few steps, as illustrated on the right of Figure 9.4 we obtain two (white) triangles which 
are D-affine equivalent to the initial triangle T re f, and three (colored) triangles on which 
the approximation error ct{xp) is zero. It follows that, up to the relation of .D-affine 
equivalence, the tree P re f is self-similar and is a repetition of the pattern presented on 



the right of Figure 9.5 



We define an auxiliary tree V rei as follows. The nodes of V' Te{ are the triangles T £ V Te f 
which are affine equivalent to T rc f with respect to D. The children T 1; T 2 of a triangle T 
in V[ ei are the grand-child and grand-grand-child of T in V re { which are affine equivalent 
to T re f. Note that 

| T 1 = 4|Ti| = 8|T 2 |. (9.39) 

Let T be a triangulation of T re f associated to a finite subtree V of V^i- We denote by V 
the smallest subtree of V re{ such that any leaf of V is contained in a leaf of V . We denote 
by T' the collection of leaves of V . For instance, if T is any of the three triangulations 



illustrated on Figure 9.4, then T' consists of the two white triangles in the right of Figure 



9.4 A triangle T £ T contains a single triangle V £ T' if ct{xp) 0, and none otherwise. 

(9.40) 



Therefore 



#(T') < #(71. 

Furthermore, since all the elements of T' are D-affine equivalent to T re f, we have 
er(Xp) 2 = e T(Xp) 2 > e T(Xp) 2 = c Y l T l 



(9.41) 



TeT 



TeT' 



TeT' 
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where Co = er Ici (xp) 2 /|^ref |- For any triangle T e V', we denote by £(T ) the collection 
of the elements of T' that it contains, 



and we define 



f (T ) := {T e T ; T C T }, 
i(T ):= ]T |T| and n(T ) := #(£(T )). 

Te£(T ) 



We establish below that for any T G the following inequality holds 

|T | < a(T )n(T ) A . 



(9.42) 



Once this property is established, then choosing To = T re f and combining this inequality 
with jpQ) and g4l] ) yields 

e r (Xp) 2 > c Y^ \ T \ 
TeT' 
= c a(T rcf ) 

> c |T ref | n(T rcf )~ A 

= c 2 #(TT A 

> c 2 #(T)- A , 



where c 2 = c |T ref | = er rcf (xp) 2 - This establishes the announced result (9.37) and concludes 
the proof of this proposition. In order to prove (9.42) we use an induction argument on 
the tree V, and for that purpose we distinguish two cases. 

i) If T is a leaf of V, then T G T' and therefore a (To) = |T | and n(T ) = 1, hence 
( [912] ) holds. 

ii) Otherwise To has two children T% and T2 in V', and we may assume as an induction 
hypothesis that 



|T I < a(T) n(Ti) A and |T 2 | < a(T 2 ) n(T 2 
Applying the holder inequality 

U\Vi + u 2 v 2 < (u[ + ul)p(vl + ti|)5 

to the numbers 



(9.43) 



Ui = a(Ti)p, Vi = n(Ti)i, p = 1 + A, g 
and elevating to the power p, we obtain 



1 + A _ p 

~ a' 



(a^utTi)*)* + (a(T 2 )n(T 2 ) A )M < (a(T) + a(T 2 )) (n(T) + n(T 2 )) x . (9.44) 
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Figure 9.6 - Bisection of a triangle, some of its children, grand-children and grand-grand- 



children, with decision function satisfying Assumptions 9.3.1 



If follows that 



< 
< 




\Ti\p + \T 2 \p 

(a(T 1 )n(T 1 ) A )^ + (a(T 2 )n(T 2 ) A )^ 

(a(T 1 ) + a(T 2 ))(n(T 1 )+n(T 2 )) A . 
a(T )n(T )\ 



where we use successively (9.38), (9.39), (9.43), (9.44) and the equalities a(T ) = 
a(Ti) + a(T 2 ) and n(T ) = n{Ti) + n(T 2 ). Again (9.42) holds, and by induction it 
holds for any element of V. 



o 



We now turn to the proof of the upper estimate (9.34), and for that purpose we begin 



with a geometrical lemma which uses the two assumptions (9.3.1 ) to analyse the first steps 
of the refinement procedure. 

Lemma 9.3.3. Let T be a triangle such that ct{xp) 7^ 0. // the decision function satisfies 



(9.3.1), then one of the following holds 



i) A child T' produced by the bisection ofT satisfies ct'(xp) = 0. 

ii) Applying iteratively the refinement procedure to T, some of its children, grand- children 
and grand- grand- children we obtain a triangulation TofT such that 

- #(T) < 5 ; and \T'\ > ±\T\ for all T e T. 

- The approximation error eT>{xp) non-zero for at most two triangles T' G T. 
Denoting these triangles by Ti,T 2 , one of the following two possibilities holds 



\T\ 



8|T 2 



2|Ti| = 16|T 2 | or \T\ = 4|Ti| 
Proof: Rather than reading a long discussion, the author invites the reader to check that, 



if i) is impossible, then one of the two possibilities illustrated on Figure 9.6 occurs and 
satisfies ii). 
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Proposition 9.3.4. Let T/v be an arbitrary triangulation in M 2 and let (Tn)n>n be the 
sequence of triangulations generated by the greedy algorithm applied to the function f = Xp 



and starting from the triangulation %. If the decision function satisfies Assumptions 9.3.1 
then 

er N (Xp)<CN- x / 2 , 
where A = 1.464 • • • is defined by (9.38). 



Proof: A proof of this proposition can be obtained by a straightforward adaptation of 



Lemma 9.3.7 and Proposition 9.3.8 in the next section, which is left to the reader. 



o 



9.3.2 Six bisection choices 



The previous section contains a negative result, Proposition 9.3.2 showing that the 



greedy refinement procedure does not achieve the desired convergence rate (9.33) when it 



is based on the three bisection choices Sj- and a decision function satisfying the natural 



assumptions 9.3.1 



In this section the segment along which a triangle T is bisected is picked among the 



six possibilities in Sy, illustrated on Figure 9.2, instead of the three possibilities in Sj< 



If the decision function satisfies Assumptions 9.3.5 then the desired convergence rate 



(9.33) is achieved, see Proposition 9.3.8 This result can be seen as a first step towards 
the construction of a hierarchical and anisotropic refinement algorithm well adapted to 
the approximation of cartoon functions. 

The following assumptions on the decision function dx(s, f) are illustrated on Figure 



9.7 Their purpose is to create, through the refinement process, a thin layer of triangles 



along the line D. Note that for any triangle T any for any segment s G S^, the two 
children of T have areas ||T| and 

Assumptions 9.3.5. Let T be a triangle such that ct{xp) 7^ 0. In the following we 
work under the assumption that segment s 6 which minimizes the decision function 
dr(s,Xp) satisfies the following properties. 

a) If there is such a possibility in S®, the bisection of T along s creates a child X" such 
that ex'ixp) — and \T'\ = \\T\. 

b) If a) is impossible, then whenever there is such a possibility in Sj>, the bisection of T 
along s creates a child T' such that ex'ixp) = an d \T'\ = \\T\. 

c) If a) and b) are impossible then the decision function selects a segment s = [vi,b] G S® 
joining a vertex V\ of the triangle T to the barycenter b = 1^2+1^3 of the other vertices. 
The choice of s must be such that the line D intersects the segment [b, 1)3]. 

The next lemma uses these three assumptions to analyse the first steps of the refine- 
ment procedure on a triangle T. 

Lemma 9.3.6. Let T be a triangle such that ct{xp) 7^ 0. If the decision function satisfies 



(9.3.5), then one of the following holds 



i) A child T' produced by the bisection ofT satisfies &t'(xp) = 0- 
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Figure 9.7 - Illustrations of Assumptions 9.3.5, a) left, b) center and c) right. The line 



D (Thick), forbidden bisection choices (dashed), authorized bisection choices (full). 



ii) Applying iteratively the refinement procedure to T , some of its children, grand- children 
and grand- grand- children we obtain a triangulation TofT such that 

- #(T) < 7, and \T'\ > £\T\ for all T G T. 

- The approximation error e^/(xp) is non-zero for at most two triangles T' G T . 
Denoting these by Ti,T 2 , there exists i G {0, 1,2} such that 



\Ti\ 
\T\ 



1 

< - x 
~ 3 



and 



T 2 2 
T ~ 9 



(9.45) 



Proof: Rather than reading a long discussion, the author invites the reader to check that, 



if i) is impossible, then one of the three possibilities illustrated on Figure 9.8 occurs and 
satisfies ii). 



o 



Let T r be a triangle such that erXxp) 0- The decision function applied to T r and its 
descendants defines a master tree V r of triangles. If V is a finite subtree of V r , (in other 
words V contains T r , and the nodes of V have either two or no children), then the leaves of 
V define a triangulation T of T r . The next lemma describes some of these triangulations. 

Lemma 9.3.7. For any < e < 1, there exists a triangulation % of T r associated to a 
finite subtree ofV T , which satisfies #(%) < Ce^^ and 



max{|T| ; T eT e and er(xp) 7^ 0} < £ \T r 
where C = 30 and fi = 0.329 • • • is the solution of 



4 

27 



+ 



(9.46) 



(9.47) 



As a result, the master tree V r contains at most Ce M triangles T such that ct{xp) 2 ^ 
e\T r \. 

Proof: We consider a second tree V' r of triangles, with root T r . A triangle T EV' r such 
that exixp) = has no children. A triangle T e V' T such that erixp) has at most 7 



0. 



children, those described in Lemma [9. 3. 6| Two cases are possible : 

i) The triangle T has two children and one of them, denoted by T", satisfies eT>{xp) 
The child T" of T is therefore a leaf of the tree V' r . 

ii) The triangle T has at most 7 children, and at most two of them 7\, T 2 are not leaves 



of the tree V' r . Furthermore the areas of T 1; T 2 satisfy (9.45) for some % G {0, 1, 2}. 
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Figure 9.8 - Bisection of a triangle, some of its children, grand-children and grand-grand- 



children, with a decision function satisfying Assumptions 9.3.5 



We denote by V' £ the subtree of V' r created as follows : we start from the root T r , and 
we include the children of a triangle T if and only if e-r(xp) 7^ and |T| > e\T r \. We 



define % as the collection of leaves of V' £ and we remark that (9.46) is satisfied. Note that 



the parent T G V' £ of any triangle T' G V' £ satisfies |T| > e\T r \. Lemma 9.3.6 therefore 
implies that 

|T'| > for any T G V' E . 



3 4 

We associate to each triangle T G V' £ the number 

n(T ) := #{T G % ; T C T }, 
and we intend to show that for any triangle T in the tree V' £ one has 

" |To| 



(9.48) 



n(T ) + 5<C 



e\T r 



(9.49) 



Once this point is established, choosing To = T r concludes the proof of the first part of this 
proposition. For that purpose, we proceed by induction on the tree V' e . We first remark 



that for any leaf T of V' £ is follows from (9.48) that 



C 



e\T r \ 



> C 



> 6 = n(T ) + 5. 



If Tq is not a leaf of V, then we may assume as an induction hypothesis that (9.49) 



holds for all the children of To. We now distinguish between the two types of nodes i) and 
ii) of the tree V' £ , and we obtain the following. 

Type i) The triangle To has two children, Ti, T2, where T2 is a leaf of the tree V' e . Since 
T is not a leaf of V' £ , it satisfies |T | > e|T r |. Furthermore |Ti| < l|T |, hence 



C 



go] 
e\T r 



n f 2 \ T °\ 
= C - x J — L , 

\3 e\T r \ 

\e\T r \ 
> r2(Tx) + 5 + l 
= n(T ) + 5. 



C 1 



\T \ 

e\T r 



C 1 



2 \ / 
3 



Therefore T satisfies (9.49). 
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Type ii) The triangle To has at most seven children, and at most two of them, denoted 
by Ti, T 2 , are not leaves. Hence n(T ) < n(Ti) + n(T 2 ) + 5. Using the estimate 



(9.45) on the areas of Ti and T 2 , and the definition (9.47) of /i, we obtain 



Therefore 



\Ti\ 
IToI 



< 1. 



C 



|T n 



e TJ 



> C 



\Ti\ 



+ C 



I To 



e\T r \J \e\T r 

> n(Tx) + 5 + n(T 2 ) + 5 

> n(T )+5, 



which concludes the proof of (9.49). 



We now turn to the second part of the proposition, and for that purpose we denote by V £ 
the subtree of V r associated to the triangulation %. Note that this tree is distinct from 
the subtree V' £ of V' r . In particular V £ is a binary tree while V' e is not (the tree V' e can 
be obtained by grouping some of the elements of V £ in a single node with some of their 
descendants) . 

Since V £ is a binary tree, the cardinality of the set V £ \ T £ of its inner nodes has a 
simple expression 

#(7> £ \T e ) = #(T £ )-l. 

Furthermore, any triangle T G TV which does not belong to V £ \ % is an element of T £ or 
one of its descendants, hence T satisfies erixp) = or er(xp) 2 < |T| < e|T r |. Therefore 

#({T G P r ; e T ( X p) 2 > e|T r |}) < #(P £ \ %) = #(T £ ) - 1 < Ce~^ 

which concludes the proof. 



Proposition 9.3.8. Let 7jv be an arbitrary triangulation in M 2 and let (Tn)n>n be the 
sequence of triangulations generated by the greedy algorithm applied to the function f = Xp 
and starting from the triangulation %. If the decision function satisfies the assumptions 



9115] then 

e TN (Xp) < CN-", (9.50) 

where v = ± - \ = 1.0188- •• . 

Proof: Let To be a triangle, let / G L 2 (T) and let Ti, T 2 be the children produced by 
some bisection of T. Then 

e Tl (ff + e T2 (ff = inf ||/-7r 1 ||| 2(Tl) + ||/-7r 2 ||| 2(T2) 

7Ti ,7T2 1 Jr 1 

= e To (/) 2 . 
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Hence 

max{e Tl (/),e T2 (/)} < e To (f). 

The decision function applied to the triangles in % and their descendants defines a collec- 
tion Vo of Nq master trees. It follows from the previous inequality that the approximation 
error Ct{xp) is larger on a triangle T than on any of its descendants. 

At each step N > N , the greedy algorithm selects for bisection a triangle in T N G 7jv 
which maximizes the approximation error ct{xp) among all T G T/v- As a result we have 
for any N > N , 



e T No (Xp) > e T Nn+1 (Xp) > ■ > ct n (xp) > max{e T (xp) 5 T eVq\ {T, 



AToj 



We thus have for any N > N , since #(7/v) = N, 



er N (xp) 2 = yZ 6 t(xp) 2 < N e Tjv (xp) 2 



(9.51) 



According to Lemma 9.3.7, for any < e < 1 the collection P of N trees contains at 

most CqE~l l triangles T such that erixp) 2 — £ 5 where Co is independent of e. Choosing 

i i 
e = C£ (N — N + 1) S we obtain 

e TN (Xp) 2 <cl(N-N + l)-l. 



Combining this with (9.51) yields (9.50) which concludes the proof. 



Remark 9.3.9. The convergence rate obtained in Proposition 9.3.8 actually exceeds the 
desired error estimate (9.33). This point is due to the fact that f = xp is a particularly 



simple cartoon function. For a function cartoon f which is discontinuous along curved 
edges, instead of a straight line, or which is not piecewise affine, the convergence rate 



(9.33) is optimal. 



9.3.3 Partitions of a domain into convex polygons 

We suggest in this section another modification of the refinement procedure, in an at- 



tempt to improve the result of Proposition |9.3.8[ This proposition is indeed not completely 
satisfactory, in particular because of the following three reasons. 



The decision function g?t(s, /) needs to satisfy Assumptions 9.3.5 Unfortunately 
numerical results suggest that the natural decision function (9.30) does not satisfy 



these assumptions, and the author has not found a decision function with a simple 
expression which satisfies these assumptions. 

- The number of segments s G along which a triangle T may be bisected has 
increased from 3 to 6. From the point of view of data compression, the hierarchical 
bisection tree is therefore more expensive to encode. From the theoretical point of 
view, the algorithm could be regarded as less elegant. 

- Since the cartoon function / = xp is particularly simple, see Remark 9.3.9| a conver- 



gence rate faster that (9.50) could be expected. For example an exponential conver- 



gence rate e-j- N (xp) < Ce^ N where (3 > 0. 
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The modification of the algorithm described below solves these three points, at the 
following cost. The modified greedy algorithm does not produce triangulations any more, 
but partitions into convex polygons of the domain Q on which the function / to be 
approximated is defined. 

We denote by C the collection of bounded closed and convex sets of nonempty interior 
included in M 2 . We denote by the capital letter T the elements of C, whether they are 
triangles or not, and by T any partition of a domain Q into elements of C. The definitions 



(9.27) and (9.28) of the approximation errors er(f) and er(f) are unchanged. 

Let T G C be a bounded closed and convex set, and let a, b, c G dT be three distinct 
points on the boundary of T. If none of the segments [a, b], [b,c] or [c, a] is included in 
dT, then the complementary in T of the triangle of vertices a, b, c has three connected 
components. We denote by T a the closure of the connected component which does not 
contain a, and we define T b and T c likewise, see Figure |9.9| The bisection of T along 
the segment [b,c], [c,a] or [a, b] produces a pair of children {T a ,T \ T a }, {TJ,,T \ T&} or 
{T c , T \ T c } respectively which also belong to C. We define 

r(T; a,b,c) := mm{\T a \,\T b \,\T c \} 

and we set the convention r(T; a, b, c) = for any a, b, c G dC such that the complemen- 
tary in T of the triangle of vertices a, b, c has only two connected components. We also 
define 

t(T) := sup r(T; a,b,c). (9.52) 

a,b,c£dT 

The next proposition gives a lower bound for the ratio r(T)/\T\. 



Proposition 9.3.10. The supremum (9.52) is attained and for any a,b,c satisfying 



r(T; a,b,c) = t(T), we have \T a \ = \T b \ = \T C \ = r(T). Furthermore for any T G C, 
one has 

\T\ < 7r(T). (9.53) 
Proof: For any fixed T G C, the boundary dT is compact and the map 



(a, b, c) G (dT)' 3 h->- r(T; a, 6, c) 
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V 



P 



b' 



P 



Figure 9.10 - Left : notations of Proposition 9.3.10 Center : three triangles contained 
in the convex sets T a , T b and T c , Right : three triangles contained in T a /, T b > and T c i. 



is continuous. Hence the maximum (9.52) is attained. Furthermore let a,b,c G dT and 



let us assume that \T a \ > \T b \, then moving the point c G dT in the direction of T a 
continuously decreases \T a \ and increases \T b \. As a result, if the triplet (a,b,c) G (dT) 3 



maximizes (9.52), then one must have \T a \ 



= IT I 

z + zq, where G GL 2 and Zq G M. 2 , one 



For any affine change of coordinates &(z) 
easily checks that 

r(T) _ r($(T)) 
\T\ ~ \$(T)\ ' 

Let T G C and let (a, b, c) G (dT) 3 be a triplet maximizing (9.52). In order to establish 



(9.54) 



(9.53) we may assume using (9.54), up to an affine change of coordinates, that the points 



a, 6, c are the vertices of an equilateral triangle T eq centered at the origin O = (0, 0) G 



and of edge-length 2 = \a — b\ = \b — c| = |c — a\. See Figure 9.10 for an illustration of 
this triangle and of the notations defined below. 

We denote by a' the point on dT such that O G [a, a'}, and we denote by a the distance 
from a' to the segment [b, c\. Since the convex set T a contains the triangle of vertices a', b, c, 
we have 

\b — c\ a , 

1 (9.55) 



T > 



a. 



We define 6', d and /3,7 similarly. The triplet (a',b',d) G (dT) 3 defines three convex sets 
T a i , Ty , T c / . Recalling that 

r(T) = r(T; a, b, c) = \T a \ = \T b \ = \T C \ 

> r(T; a',b',d) = min{|T a /|, \T b >\, \T d \}, 

we obtain 

t(T) > max{|T a |, \T b \, \T C \, min{|T a /|, \T V \, \T C ,\}} (9.56) 
Since \T a /\ contains the triangle of vertices a, b', d, an elementary computation shows that 

> \det(V-a,d-a)\ = \y/3(l - fhj) + g + 7 | 



hence 

a, /3, 7, + - min{/3 + 7 — Vsfl'y, 7 + a — \/37a, a + — V?>af5} 
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Figure 9.11 - Left : A convex set (red, dashed), the segments S^* (blue, dotted), the line 
D (black, thick). Right : After 8 steps of the greedy algorithm, with a decision function 



satisfying Assumptions 9.3.11 



If the minimum appearing in the previous equation is negative, then a, (3 or 7 is lar- 
ger than l/\/3 > v3/4. It follows that r(T) > v^3/4, which concludes the proof since 
\T\ = |T eq | + 3r(T) = V3 + 3r(T). 



o 



We denote by S^* '■= {[a, b], [b,c], [c,a]} the collection of segments joining the three 
points a,b,c G (<9T) 3 which maximize (9.52). Some convex polygons and the associated 



optimal triplets (a, b, c) G (<9T) 3 are illustrated on Figure 9.9 If there are several triplets 



(a, 6, c) realizing this maximum, for instance if T is a disc or a regular polygon, then we 
consider the minimal triplet with respect to the lexicographic order on (K 2 ) 3 . 

From an algorithmic point of view, for any convex polygon T G C the points (a, b, c) 



which maximize (9.52) can be found by solving a finite number of third degree equations 



(one for each triplet of distinct faces of T). Note that Proposition 9.3.10 is not optimal. 



We conjecture that the minimal value r{T)/\T\ is attained when the convex set T is 
the unit disc, and is equal to (7T/3 — \/3/4)/7r ~ 0.195 ■ ■ ■ , which is stricly larger than 
1/7 ~ 0.142- ••. 

Assumptions 9.3.11. Let T G C be such that e-r(xp) 7^ 0. In the following we work 
under the assumption that the segment s G S^* which minimizes the decision function 
dr{ s i Xp) satisfies the following property. 

a) The bisection of T along s creates a child T' such that er'(xp) = 0. (This is always 
possible). 

The following corollary shows that, under the previous assumption on the decision 
function, the refinement procedure based on these bisection choices creates an exponen- 
tially thin layer around the discontinuity of the function / = xp, as illustrated on Figure 

Em 

Corollary 9.3.12. Let Q C IR 2 be a convex polygon, and let (Tn)n>i be the sequence of 
partitions of Q into convex polygons generated by the greedy algorithm applied to the func- 
tion f = Xp an d starting from T\ = {^}- If the decision function satisfies Assumptions 
\9.S.11\ then 

erAxpf^m-WY- 1 - 
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Figure 9.12 - Triangulations produced by the greedy algorithm applied to the function 
f(x,y) = x 2 + 100y 2 with the decision function (9.57) and 50 
Likewise for the function f(x,y) 



and 200 (ii) elements. 



x 2 + y 2 and 30 elements 



lii). 



Proof: It follows from the assumptions 9.3.11 that for all iV > 1 at most a single element 
T e Tn satisfies erixp) 0- We denote this element by T/v, and we remark that T/v + i 
is a child of Tjv- It follows from Proposition 9.3.10 that |Tjv+i| < (1 — l/7)|Tjv|, for any 
N > 1, hence 



e TN (Xp) 2 = e TN (Xp) 2 < \T N \ < (1 - 1/7) 



N-l 



which concludes the proof. See Figure 9.11 for an illustration 



The bisection of T G C along a segment s G 

of the notations previously used in this section, the bisection along s 

T n and T 2 



creates two children and T 2 . In terms 

[b, c] G creates 



the two children T} 
function dr(s, f) is defined by 



T \T a . From this point, we assume that the decision 



d T (s, f) := min 



mil 



IT 2 1 i 



(9.57) 



If the approximation error of / on or T 2 is zero, then o?t(s, /) = 0. Therefore this deci- 



sion function satisfies Assumptions 9.3.11 and leads to an exponentially fast convergence 
if / = Xp according to Corollary |9.3. 12 



Let us consider a quadratic function f(z) = -^zrQz 1 where Q is a 2 x 2 symmetric 
matrix, to be approximated on a domain Q. As seen in the previous chapters, the ele- 
ments of a partition T of Q should adopt, for an efficient approximation of /, an aspect 
ratio dictated by the hessian matrix d 2 f = Q of / (at least in an average sense). The 
numerical experiments presented in Figure |9.12 suggest that the greedy algorithm, based 
on the bisection choices S^* and the decision function (9.57), generates such partitions. 
Unfortunately, the author could not prove this property. 

Eventually, numerical experiments were also led for the characteristic function / = \d 



of the unit disc D = {(x,y) G 



p2 . 



X' 



+ y 2 < 1}- They seem to indicate that the partitions 



(Tn)n>i of the triangle of vertices (0,0), (2,0), (0,2) generated by the greedy algorithm 
do satisfy the desired error estimate 



e TN {XD)<CN-\ 
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Figure 9.13 - Triangulation produced by the greedy algorithm applied to the function 
/ = Xd-i with the decision function (9.57) and 200 elements (i). Detail (ii). Decay of the 
error cj- n {xd)i m logarithmic scale, for 1 < iV < 40000. 



see Figure 9.13 However, again, the author could not establish this property. 



For better numerical precision, the numerical experiments illustrated in Figures 9.11 



9.12| and |9.13| were conducted without discretizing function / of interest but using its 
explicit form in order to have an exact expression of the L 2 approximation error. 



9.4 A greedy algorithm based on rectangles 

We study in this section another variation of the greedy algorithm which is simpler 
than the previous ones. This algorithm produces partitions (1~n)n>i of the unit square 
Q = [0, l] 2 into rectangles aligned with the horizontal or vertical £1X6 S ? clS studied in 
Chapter 1 and illustrated on Figure [5] in the main introduction of the thesis, instead 
of triangulations built of triangles of arbitrary direction. Here we work with piecewise 
constant approximants, and we measure the approximation error in the L°° norm. These 
important simplifications allow us to establish an error estimate which applies to any 
function / G C 1 (f2) and is asymptotically optimal in accordance with the results in 
Chapter 1. We also establish a second error estimate which gives some valuable (yet non 
optimal) information in a non-asymptotic sense, i.e. for any value of N > 1. 

In order to state our main result, we introduce some notation. A rectangle T C K 2 is 
a set of the form 

T = I x J 

where I and J are compact intervals of R. For any rectangle T and any / in C°(T), we 
denote by ey(/) the difference between the supremum and the infimum of / on T. 

e T (f) := sup/-inf /. 

T T 



Note that er(/) is also the double of the error of best approximation of / on T in the L c 
norm, 

er(/) = 2inf ||/ - c\\ L oo {T} . 
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If a function / G C l (T) attains its maximum on the rectangle T at the point zm = 
(xm,Um), and its minimum at the point z m = (x m ,y m ) then observe that 



f(z M ) - f(z m ) < 
< 



x M 

d x f(x,y m )dx 



Dm 

d y f(x M ,y)dy 

Vm 



\x M - aJm|||3r/IU°°(T) + \y M - y m |||9 y /||i ( oo( T ). 

Therefore if T = I x J, we obtain 

e T (f) < |/|||^/|Uoc (T) + \J\\\d y f\\ L ~ {T) . (9.58) 

For any partition T of a rectangle T into smaller rectangles, and for any / G C°(T), we 
denote by e-j-(f) the maximum of er(f) among all T G T : 

er(/) : = maxe r (/). 



We clearly also have 



er(/) =2 miii ||/ - #||l~ ( t), 



where is the collection of functions on T which are constant on the interior of each 

T G r. 

Remark 9.4.1. Please note that the definition of ey(/) m £/ws section differs a lot from 
the definition of ' &r{f) in the previous section. Indeed, er{f) stands here for the double of 
the best approximation of f in L°°(T) norm by a constant c G R, whereas ey(/) stands in 
the previous section for the best approximation of f in L 2 (T) norm by an affine function 
7T G Pi. 

For any rectangle T we denote by (T^, T u ) the down and up rectangles which are 
obtained by a horizontal split of T through its barycenter and by (T;,T r ) the left and 
right rectangles which are obtained by a vertical split of T through its barycenter. For 
any function / G C°(T), we define 

er,h(/) :=max{e Td (/),e Tu (/)} and e T)W (/) := max{e T( (/), e Tr (/)}. 

The greedy algorithm studied in this section takes as input a function / G C°(Q), 
where Q = [0, l] 2 is the unit square, and a parameter p G [0, 1]. Given a partition T of 
f2 into rectangles aligned with the coordinate axes, the greedy algorithm produces in one 
step a refinement T' of T, satisfying #(T') = #(T) + 1, proceeding as follows. 

1. Select a rectangle T G T which maximizes er(/)- 

2. (Greedy split) If 

min{e T , ft (/), eT> (/)} < pe T (/), (9.59) 

then bisect the rectangle T horizontally if eT,h(f) < e T,v(f) and vertically otherwise, 
thus producing two children Ti,T 2 of T. Define 

r :=r-{T} + {T 1 ,T 2 }. 
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3. (Security split) If 



mm{e T ,h(f),e T ,v(f)} > pe T (f), 



(9.60) 



then bisect the rectangle T horizontally if \I\ > \J\ and vertically otherwise where 
/, J are the two compact intervals such that T = I x J. This bisection produces two 
children 7\ , T 2 of T and we define 

r :=T-{T} + {T l7 T 2 }. 

Starting from the partition 7i = {^} of the square Q = [0, l] 2 , the greedy algorithm 
produces step after step a sequence (Tn)n>i of partitions of fl into dyadic rectangles, 
satisfying #(T/v) = N. The parameter p G [0, 1] is chosen by the user. It should not be 
chosen too small in order to avoid that all splits are of safety type which would then lead 
to isotropic partitions. The choice p = 1 is not indicated either. Indeed let f(x,y) : = 
sin(47rx), then for any rectangle T of the form T = [0,1] x J, where J is a compact 
interval, one has 

e T (f) = er, h {f) = ^Af) = 2- 

If a rectangle T = [0, 1] x J is selected for bisection and if p = 1, then T is bisected 
horizontally and we are facing a similar situation for its children T u and As a re- 
sult, the greedy algorithm produces a sequence (Tn)n>i of partitions of Q consisting of 
rectangles of the form [0,1] x J where J are dyadic intervals. This shows that the ap- 
proximations produced by the algorithm fail to converge towards /, and the global error 
remains er N {f) — 1 f° r a h N > 1. On the contrary, it is established in 35 that for any 
choice of p G [0, 1[ and any / G C°(f2), the partitions (Tn)n>i produced by the greedy 
algorithm satisfy ej- N (f) — > as iV — > oo. 

We now prove that, using the specific value p = ^ and measuring the error in the 
L°° norm, the refinement algorithm has optimal convergence properties (9.61) similar to 



Theorem 1.1.5 The authors suspect that this result can be extended to the L p norm, where 
1 < p < oo, and to an arbitrary parameter p g]1/2, 1[, at the price of more technicalities. 

Theorem 9.4.2. There exists a constant C > such that for any f G C 1 (fi) ; the 
partitions (Tn)n>i produced by the modified greedy refinement algorithm with parameter 
p := ;4= satisfy the asymptotic convergence estimate 



limsup N^er N {f) < C 



dyf\ 



L-' 



Furthermore for all N > 1 one has 

e T Af)<C\\Vf\\L~(n)N-l 



(9.61) 



(9.62) 



The asymptotical error estimate (9.61) is the best than one should hope to obtain. 



Indeed it follows from Theorem 1.1.4 that there exists a constant c > independent of / 
and such that 



liminf N 2 er' (f) > c 



\9J dyf\ 



(9.63) 



L 2 
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for any admissible sequence (7n)n>n of partitions of fl into rectangles. (Admissibility is 
a minor technical condition, see Definition 1.1.3) 

Here and after, we use the £°° norm on R for measuring the gradient : for z = (x, y) G 

n 

\Vf{z)\ :=max{|<9,/(z)|,|«VWI}, 

and 

||V/|| L oc (T) := sup \Vf(z)\ = max {\\d x f\\ L °o(T), ||<VIU~(T)} ■ 

z&T 

We consider a fixed function / G C 1 (f2) and we denote by (Tn)n>i the sequence of parti- 
tions of the square Q = [0, l] 2 into dyadic rectangles generated by the greedy algorithm. 
Furthermore we define for each N > 1 

e{N):=e TN {f). 

Finally we sometimes use the n otatio n x(z) and y(z) to denote the coordinates of a point 
z G 1R 2 . The proof of Theorem 9.4.2 requires a preliminary result. 

Lemma 9.4.3. Let To = Iq x Jo G Tm be a dyadic rectangle obtained at some stage M > 1 
of the refinement algorithm, and let T = I x J G Tn be a dyadic rectangle obtained at 
some later stage N > M and such that T C T . We then have 

Proof: Since the coordinates x and y play symmetrical roles, it suffices to prove the first 
inequality. We reason by contradiction. If the inequality does not hold, there exists a 
rectangle T' — I' x J' in the chain that led from To to T\ which is such that 

< , (9-65) 

and such that T' is split vertically by the algorithm. If this was a safety split, we would 
have that | J'\ < and therefore using ( |9.58 ) 

eAf) < + \J'\)\\Vf\\ L ~(T) < 2|r|||V/|| L oo (T) < e(N), 

which is a contradiction, since all ancestors of T should satisfy ex'{f) > £ (N). Hence this 
split was necessarily a greedy split. 

Let z m := Argmin 2gT //(z) and zm '■= Argmax 2gT //(z), and let T" be the child of 
T' (after the vertical split) containing zm- Then T" also contains a point z' m such that 
\ x ( z 'm) ~ x ( z m)\ < 1^1/2 and y{z' m ) = y(z m ). It follows that 

e T >M) > e T »(f) 

> f( ZM )-f( z 'J 

\r\ 

> f{ZM) - f{Zm) ~ II^/Hloc^) — 

> eT;(/ ) _ I £(A r) 

> \e T >U) 
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where we used (9.65) to obtain the fourth line. The error was therefore insufficiently re- 



duced which contradicts a greedy split. o 



Proof of Inequality (9.62) We first observe that for any N > 1, 



e(N) < e(l) = sup / - inf / < 2||V/||r~ (n) . 



Applying Lemma 9.4.3 to the square f2 = [0, 1] x [0, 1] we obtain 

min{|/|,|J|} > ^ 

4|| V/|| L oo (n) 

for any rectangle T = / x J £ T N . Therefore 

:(X) 

T=lxJeT N 

hence 



\Q\= |/| X |J| > iV ( - 

T-fv TcT. r \ 



4||V/|U- (n) . 

N*e(N) < 4||V/|U» (n) , (9.66) 



which concludes the proof of (9.62). o 



Proof of Inequality (9.61) We consider a small but fixed 5 > 0, and we define h(8) as 



the maximal h > such that 

Vz,z'en, \z-z'\<2h(5)=>\Vf(z)-Vf(z')\<5. (9.67) 
For any rectangle T = I x J c fi, we thus have 

er(/) > (il^/IU^c^) - 5) min^C^), | J|} 3 

er(/) > (\\d y f\\ L ~ {T) -5)rmn{h(5),\J\}. [ ^ } 

Let 5 > and let M = M(f,S) be the smallest value of N such that e(N) < 95h{5). 
For all N > M, and therefore e(N) < 95h(5), we consider the partition Tv which is a 
refinement of 7m- For any rectangle T = J x J £ Tm, we denote by Tv(To) the set of 
rectangles of T/v that are contained T . We thus have 

T N := |J Tv(T ), 

and T/v(T ) is a partition of T . We shall next bound by below the side length of T = / x J 
contained in T/v (To), distinguishing different cases depending on the behaviour of / on T . 

Case 1. If T £ Tm is such that ||V/||loo(x ) < 105, then a direct application of Lemma 



9.4.3 shows that for all T = / x J £ T/v (To) we have 



|/|>min<||/o|,^} and | J\ > min [| J \, ^ } (9.69) 
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Case 2. If T e 7m is such that ||9 a ./||2,«>( To ) > 105 and ||f^/||£°°(To) > 105, we then claim 
that for all T = I x J £ T/v(T ) we have 



1/1 > min < l/i 



'o 



20||9 a; /|| L oo( ro ) 
and that furthermore 



and |J| > min < |Ji 



e(iV) 



' 2Q\\dyf\\L°°(T ) J ' 



10 



\d x f d y f\dxdy. 



(9.70) 



(9.71) 



T 



\T \ \\d x f\\ L °°(T )\\d y f\\L°°(T ) < 

This last statement easily follows by the following observation : recalling ( |9.68 ) we obtain 

95h(S) > e(M) > e To (f) > {\\d v f\\ L °°( T o) ~ S ) min{/ i (<5), |J |} > 98min{h(5), |J |}, 
therefore | Jo| < /i(<5). Similarly |/q| < ^(5) and therefore for all z G To we obtain 

9 

\d x f(z)\ > \\dJ\\ L oo (To) -6 > — ||9 x /||l»(To), 

9 

\d y f(z)\ > \\d y f\\ L oo {To) -5> Y^\\d y f\\ L ^ {To) . 



Integrating over To yields (9.71). Furthermore the previous equations show that d x f and 
d y f have a constant sign on T . Let us assume that both are positive, then for any rectangle 
T = [a, b] x [c, d] C T we obtain 

e T (f) = sup/-inf/ 

> f{b,d)-f{a,c) 



> 



d x f(x,c)dx+ / d y f(b,y)dy 



9 



> 10 (( &_a )II^H ioo ( T o) + ( d - c )\\ d yfh^(T )) 



A similar reasoning can be applied if d x f or d y f is negative is negative on T . Recalling 
(9.58) we therefore obtain for any T C T 



9 
10 



< 



e T (f) 



\d x f\\ L <~(T )\I\ + ||^/IU-(T )|^| 



< 1. 



(9.72) 



Clearly the two inequalities in (9.70) are symmetrical, and it suffices to prove the first 
one. Similar to the proof of Lemma 9.4.3[ we reason by contradiction, assuming that a 
rectangle T' — I' x J' with 

\I'\W\\i^m<^ (9.73) 
was split vertically by the algorithm in the chain leading from T to T. A simple compu- 

\\d x f\\ L ~ m \r\ 



tation using (9.72) shows that 



-FTT- < —TTT < - X — With U \- 

e T >(f) ~ e T ,Jf) " 9 l + tr/2 



IVII 



W l|L°°(To) 



J' 
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In particular if a < 0.2 the we obtain that 

&p,h{f) < per>(f) and e T ',h(f) < e T >, v (f), 

hence the algorithm performs a horizontal greedy split on T', which contradicts our as- 
sumption. Therefore a > 0.2, but this also leads to a contradiction since, using (9.73), 

e(N) < e TI (f) < \\d x f\\ L ~ {To) \I'\ + \\d y f\\ L ~ {To) \J'\ < (1 + < £ W 



Case 3. If T G Tm is such that HSr/IU^To) < 105 and ||0j//||l°°(t o ) — 105, we then 
claim that for all T = I x J G T/v(T ) we have, with Cq := 200, 



III > min < I Jq 



C 6 J 



and | J\ > min < \J \, 



e(N) 



4IIV/IL 



(9.74) 



with a symmetrical result if T is such that ||c?x/||i°°(T ) > 105 and ||^//||£°°(t ) < 105. 
The right part of (9.74) is a direct consequence of Lemma 9.4.3, hence we focus on the 
left part. Applying the second inequality of (9.68) to T = T , we obtain 

95h(8) > e To (f) > (\\d y f\\ L oo(T ) - 5)min{/i(5), |J |} > 95min{/i(5), |J |}, 

from which we infer that |Jq| < h(S). If Z\,z 2 G T and x(z\) = x(z 2 ) we therefore have 
\d y f{zi)\ > \d y f(z2)\ — 5. It follows that for any rectangle T = I x J c T we have 

(R/||l~CT) - 5)1 J| < e T (f) < R/|| Loo(T) |J| + 105|/|. (9.75) 

We then again reason by contradiction, assuming that a rectangle T' — I' X J' with 

2e(JV) 



l/'l < 



C n 5 



(9.76) 



(T') 



< 



was split vertically by the algorithm in the chain leading from To to T. If U^/Hlo 
105, then ||V/||£°a(T') < 105 and Lemma 9.4.3 shows that T' should not have been split 
vertically, which is a contradiction. Otherwise \\d y f\\L°°(T') — 5 > thII^/ |U°°(T')> and we 
obtain using (9.75) and (9.76) that 

9 2e(N) 
^\\d y f\\L~(T>)\J'\ < e T >(f) < \\d v f\\ L »<T>)\ A + 105 1 

Recalling that eT>{f) > s(N) we obtain 

20 



C n 5 



1 



Co 



(9.77) 
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We now consider the children T' v and T' h of X" of maximal error after a horizontal and 



vertical split respectively. We obtain 

e T ',h(f) = e u(f) 



< \\d y f\\L°°(T>)-Y + mi'\ 



Co 



< 



5 20 
9 + Co 



59 



where we used (9.75) in the second line, and (9.76) and (9.77) in the third line. On the 
other hand, using (9.77) in the fourth line 

e T 'M) = e T' h U) 

> (||<V|U~(T') - S)\J\ 
9 

10 1 
9 



> 
> 



\dyf\\L°°{T')\J | 



10 



1 



20 

Co 



eAf) 



100 



e T >(f). 



Therefore eT>. v {f) > ^T',h{f) which is a contradiction, since our decision rule would then 
select a horizontal split. 



We now recall that e(N) — > as N — > oo, see (9.66), and we choose N large enough 
so that the minimum in (9.69), (9.70) and (9.74) is always equal to the second term. For 
all T G T/v(T ), we respectively find that 



\T\ 



5 2 



if ||V/|| Lo = (To) < 105 



<° 2 \ W\k\ d *f d vf\ if ll^/|Uoc (To ) > 105 and || VlU-(To) > 105 



^Iiv/IL 



if \\d x f\\L°°(T ) < 105 and \\d y f\\ L oo {To) > 105 (or reversed). 



with C 2 := max{40 2 , 20 2 (10/9) 2 , 800} = 40 2 . For z G fi, we set ^ N (z) := ± where 
T G Tn is such that z G T, and we obtain 



N = #(T N ) = [ ^ N {z)dz < C 2 e(NY 



\d x f d y f\dxdy + 5\\Vf\\ L ~+5 r ' 



Therefore 



limsup N * e(N) <C [ / \d x f d y f\dxdy + 5|| V/||l» + 5' 
Taking the limit as 5 — > 0, we obtain the announced result 



lim sup N2 £ {N) < C 

N-¥oc 



dxfdyf\ 



L 2 



which concludes the proof. 



□ 
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9.5.1 Proof of Proposition 9.2.10 



Let q be a quadratic function of mixed type, and let T be a triangle with edges a, b, c 
such that |q(a)| > |q(6)| > |q(c) | . Up to a linear change of variables, we may assume 
that the quadratic part of q is q(x,y) = x 2 — y 2 . Up to a translation, linear rescaling 
and permutation between the x and y coordinates, we may assume that the edge a is 
centered at and such that q(a) = 4. We write | = (u,v), and assume that u > 
without loss of generality. Then 1 = q(|) = u 2 — v 2 , and there must be 6 G M. such that 
(u,v) = (cosh(6 l ), sinh(6 1 )). 

We next observe that the linear transformation of matrix 

cosh(6 l ) — sinh(#) 
— sinh(6 l ) cosh(6>) 

leaves q invariant, in the sense that q o = q, and satisfies 0(f) = (1, 0). Up to such a 
transformation, we may therefore assume that a = (2,0). Therefore the two vertices of 
T at the ends of the edge a are (—1,0) and (1,0). We denote the third vertex by (s,t). 
There is no loss of generality, eventually, in assuming that s > and t > 0. Note that 
\T\ = t and p q (T) = |. 

We now specialize to the case where p q (T) > 4, which is equivalent to t < 1. Recall 
that the edges of T are such that |q(a)| > |q(&)| > |q( c )|- The following lines show that 
q(l + s,t) > |q(s — l,t)|, which implies that b = —(1 + s,t) and c= (s — l,t) : 

q(l +s,t)- q(s - 1, t) = As> 0, 

q(l + s,t) + q(s-l,t) = 2(l + s 2 -t 2 ) > 0. 

In addition we see that q(6) > and we thus have proved that q(a)q(6) > 0. For the 



second and third inequality in (9.19) we remark that q(c) = (s — l) 2 — t 2 and q(cf) = s 2 — t 2 . 
Clearly 

-1 < -t 2 < min{q(c),q(rf)}. 
If < s < 1, we also clearly have 

max{q(c), q(c?)} < 1. 

If s > 1 we have 

q(c) < q(d) = s 2 - 1 2 = (s + l) 2 - t 2 - (2s + 1) = q(6) - (2s + 1) < q(a) -3 = 1. 
We thus always have 

max{|q(c)|,|q(rf)|}<^ = l, (9.78) 



which concludes the proof of the inequalities (9.19). 



Last, we specialize to the case where p q (T) = t > 8, equivalently t < ^, to prove 



(9.20) : 



q(6) + q(c) = (s + l) 2 - t 2 + (s - l) 2 - t 2 = 2 + 2s 2 - 2t 2 > - = |q(a) 

2 8 
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9.5.2 Proof of Proposition [9^8 



For any triangle T we denote by T x the interval denned as the projection of T on the 
x axis, and by \T X \ its length. We denote by It and It x the two dimensional and one 
dimensional local interpolations operators on T and T x respectively. It is clear that if g is 
a C 2 convex function of one variable and if G(x,y) := g(x), then 

\\G — ItG\\l°°(T) = \\g — It x 9\\l°°(t x )- (9.79) 

The following lemma compares the interpolation error on an interval of R and on a sub- 
interval. 

Lemma 9.5.1. Let g E C 2 (IR, R) be such that < m < g" < M. Let xi,X2,x$ be real 
numbers satisfying x% < Xl + X3 < x 2 < x 3 , and denote u := X2 — X\ < v := X3 — x\. Then 
denoting by I u and I v the interpolation operators on the intervals [£i,£2] one? [xi,^] 
respectively, we have 

\\g ~ Ivg\\L°°([ xl ,x 3 )) > mv 2 /8 



and 



\g - Iv9\\l°°([xi,x 2 ]) < ( 1 - a I \\g - Iv9\\l°°([xi,x 3 ]) 



with a := \\JmjM . 

Proof: Let us define h v = I v g — g. Since h" = —g" and h v {x\) = h v {x^) = 0, h v can be 
represented as the integral 

h v (x)= / K v (x,y)g"(y)dy. (9.80) 

J Xl 

where the Green kernel K v is given by 

1 / (x - xi)(x 3 -y) if x < y, 



K v (x,y) 

v 



(x 3 -x)(y-x 1 ) iix>y. 



Of course, we have a similar representation of h u = I u g — g with a kernel K u . 
The first part of the proposition immediately follows from 

h v ^ J >m J K v ^ ,y jdy>mv /8. 

In order to prove the second part, we shall compare the Green Kernels K u and K v . For 
this purpose, we define 

(x 2 - x)/u 
(x 3 - x)/v 

For all x, y e [xi,^], we thus have 

I 77 L r 1 V \ = min{/x(x),/x(y)} < fi(x). 
K v {x,y) 
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Therefore, defining x u := argmax^^^^/ug — g)(t) and using (9.80), we obtain 

\\g — Iug\\L°°([xi,x 2 ]) — h u (x u ) 

= f** K u (x u ,y)g"(y)dy 

< K x u) f x * K v {x u , y)g"(y)dy 

< K x u) LI K v(%u, y)g"(y)dy 

< K x u)\\g - Ivg\\L°°([xi,x 3 ])- 

In order to conclude, we need to estimate fi(x u ). One easily checks by differentiation that 
fi is decreasing and concave on the interval [xi,^], and therefore for all x G [xi,^] 



n(x) < 1 — (x — xi) 



v — u 



Differentiating the integral representation of h u , we obtain 



uh' u (x u ) = - (y- x 1 )g"(y)dy + (x 2 - y)g"(y)dy 



Since h'(x u ) = and < m < g" < M, we obtain 

(x 2 - Xufm < (x u - xi) 2 M. 
Since x 2 > Xl ~! 2 X3 , this gives x u — x\ > y/m/M^. Therefore, 

-v — u 



fi{x u ) < 1- --s/m/M- 



which concludes the proof. 



o 



The following corollary uses the above lemma to compare the values of the L°°-based 
decision functions for a convex function that depends only of one variable. For any vector 
v G M 2 we denote by v x the absolute value of its x coordinate. 

Corollary 9.5.2. Let T be a triangle with edges a,b,c, such that a x > b x > c x . Let 

G(x,y) = g{x), where g G C 2 and < m < g" < M. Then, with dr defined by (9.9), 

d T {b,G) - d T (a,G) > Ca x (a x -b x ), 
d T (c,G)-d T (a,G) > Cal/2, 



with C 



,3/2 



32VM 



Proof: We recall the notation a T (f) ■= \\f — IrfWh™- It follows from (9.79) that 

a T {G) = \\g - lT x g\\L°°(T x )- 



We label the extremities of a by i G {1,2}, and denote by T e l , i G {1,2}, e e {a, b, c} the 
child of T resulting from the bisection through the edge e in such a way that T* contains 
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the extremity of a of label i. We denote by T\ x the projection of T\ onto the x-axis. Then 
(up to exchanging the labels of the extremities of a), 

|-^a,xl = \T a ,x\ = a a:/2, 

\^b,x\ = a xi l^6,J = C x J rb x /2, 

I^J = 6x + ^A |T 2 J = a x . 

In particular, we have T% C T b 2 and C T£ x and therefore 

«^(G) = \\g - I Tlx g\\ Laa{ Ti x) < \\g - I T 2j\\ L ^ {T 2j = a T 2(G), 

and similarly at^i (G) < a^i (G). Moreover, we can apply the previous lemma with [x\, x?\ = 
C T^ x = [xi, x 3 ] or with [xi, x 2 ) = T% x C T% x = [xi, x 3 ] which respectively leads to 

a T i(G) — ari(G) > 1 =a x {a x — b x ), 



and 

a T *{G)-a n {G)>^=al/2. 

This allows us to conclude since dx(e, G) = a T \ + a T 2, for s G {a, b, c}. 

Using the above result, we now prove that the decision function tends to prescribe 
a longest edge bisection with respect to the euclidean metric when the triangle T becomes 
too thin. 

Corollary 9.5.3. Let f be a convex function such that mid < d 2 f < Mid, and let T be 
a triangle with measure of non-degeneracy cr(T) for the euclidean metric and edges a, b, c, 
such that \a\ > \b\ > \c\. Then if a IT) > 2K the bisection prescribed by the decision 
function dr{-, /) is a S-near longest edge bisection with respect to the euclidean metric (in 
the sense of definition 8.4.3), with K = 128(^) 3/2 and 5 := JL. 

Proof: We denote by p(X) the affine orthogonal projection of a point X G 1R 2 onto the 
line which includes the edge a, and we denote by O the midpoint of a. We then define 

f(X) = f(p(X)) + df (X-p(X)). 

Then, using the notation X(z) = p{X) + z(X — p(X)), we have 

f(X)-f(X) = f(X)-f(p(X))-df (X-p(X)) 

(df x{z) -df )(X-p(X))dz. 

But for X G T, we have X(z) G T for all z G [0, 1], so that \X(z) — 0\ < \a\ and therefore 

\\df X (z)-dfo\\ <M\a\. 
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We also have 

|x-p(x)i<311 < 



a(TY 



Therefore, ||/ — f\\L°°(T) < M^L. We can apply the previous lemma to the function /, 
since it is the sum of a function of one variable and of an affine function which has no 
effect on the interpolation error. Assuming without loss of generality (up to a rotation) 
that a is parallel to the x axis, this gives us 

d T (bJ)-d T (aJ) > C\a\(\a\-b x )>C\a\(\a\-\b\), 
d T (cJ)-d T (aJ) > C\a\ 2 /2. 

3 /2 

with C = ^rj^ - Since the Lebesgue constant of the interpolation operator on any triangle 



is 2, we have 

|dr(e ) /)-d r (e ) /)| < 4||/ - /|| L cc (T) 

which implies the following inequalities 

d T (bJ)-d T (aJ) > C\a\(\a\-\b\)-4M\a\ 2 /a(T), 
d T (cJ)-d T (aJ) > \a\ 2 (C/2-AM/a(T)). 

The second inequality shows that the edge c cannot be cut if C/2 — 4M/a(T) > which 
is equivalent to <j(T) > 2K. The first inequality shows that b may be cut provided that 
C(\a\-\b\)-AM\a\/a(T) < 0, i.e. \b\ > (1 - ^y)|a| which shows the property of 5-near 

longest edge bisection with 5 := ^tL-. o 



The proof of Proposition 9.2.8 directly follows from this last result, proceeding exactly 
as in the last paragraph of the proof of Proposition 8.4.10 
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The 2-d anisotropic mesh generator BAMG : www.freefem.org/ff++/ (included in 
the FreeFem++ software) 



FreeFem++ software, developped by Frederic Hecht,www.freefem.org/ff++/ 



A 3-d anisotropic mesh generator : www. math. u-bordeauxl . f r/~dobj/logiciels/ 
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